CodeScout - a OpenHands Collection

OpenHands 's Collections

updated 5 days ago

RL-trained code search agents (1.7B, 4B, 14B) that outperform 2–18× larger models using only a Unix terminal. 📄 arxiv.org/abs/2603.17829

Upvote

OpenHands/CodeScout-14B

Text Generation • 15B • Updated 5 days ago • 82 • 2

Note 🏆 CodeScout-14B — strongest model, SOTA on SWE-Bench Verified/Pro/Lite
OpenHands/CodeScout-4B

Text Generation • 4B • Updated 5 days ago • 88

Note ⚡ CodeScout-4B — outperforms 8× larger Qwen3-32B across all benchmarks
OpenHands/CodeScout-1.7B

Text Generation • 2B • Updated 5 days ago • 83 • 1

Note 🔬 CodeScout-1.7B — post-RL checkpoint, outperforms 8× larger Qwen3-14B
OpenHands/CodeScout-1.7B-RFT

Text Generation • 2B • Updated 5 days ago • 81 • 1

Note 📦 CodeScout-1.7B-RFT — pre-RL (rejection fine-tuned) checkpoint
OpenHands/CodeScout_Training_Rollouts

Viewer • Updated 6 days ago • 54.8k • 13 • 1

Note 🗂️ Training rollouts from SWE-Smith environments
OpenHands/CodeScout_Eval_Rollouts

Viewer • Updated 6 days ago • 12.7k • 12

Note 📊 Evaluation trajectories on SWE-Bench Verified, Pro, and Lite
OpenHands/SWE-smith-py-code-search

Viewer • Updated 6 days ago • 39.3k • 7

Note 🔍 SWE-Smith code search localization targets
OpenHands/SWE-Gym-code-search

Viewer • Updated 6 days ago • 2.32k • 8

Note 🔍 SWE-Gym code search localization targets
OpenHands/SWE-rebench-code-search

Viewer • Updated 6 days ago • 17.6k • 8

Note 🔍 SWE-rebench code search localization targets
OpenHands/SWE-bench_Verified-locagent

Viewer • Updated 6 days ago • 500 • 11

Note 🎯 SWE-Bench Verified — localization ground truth
OpenHands/SWE-bench_Lite-locagent

Viewer • Updated 6 days ago • 300 • 8

Note 🎯 SWE-Bench Lite — localization ground truth
OpenHands/SWE-bench_Pro-locagent

Viewer • Updated 6 days ago • 264 • 8

Note 🎯 SWE-Bench Pro — localization ground truth

Upvote