Open-Reasoner-Zero/Open-Reasoner-Zero-32B Reinforcement Learning • 33B • Updated Apr 7, 2025 • 27 • 33
Open-Reasoner-Zero/Open-Reasoner-Zero-7B Reinforcement Learning • 8B • Updated Apr 7, 2025 • 130 • 33
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-32B Reinforcement Learning • 32B • Updated Apr 7, 2025 • 15 • 6
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-7B Reinforcement Learning • 7B • Updated Apr 7, 2025 • 12 • 1
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-1.5B Reinforcement Learning • 2B • Updated Apr 6, 2025 • 5 • 1
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-0.5B Reinforcement Learning • 0.5B • Updated Apr 7, 2025 • 11