tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16, 2025 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17, 2025 • 3 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17, 2025 • 9 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18, 2025 • 16
tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16, 2025 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17, 2025 • 3 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17, 2025 • 9 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18, 2025 • 16