Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 17
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • 15B • Updated Feb 24, 2025 • 510k • • 632
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated Jan 30 • 149k • 594
Qwen/Qwen3-Coder-30B-A3B-Instruct Text Generation • 31B • Updated Dec 3, 2025 • 2.16M • • 1.02k