SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 2 days ago • 46
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27, 2025 • 38 • 4
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27, 2025 • 38
An Empirical Study on Eliciting and Improving R1-like Reasoning Models Paper • 2503.04548 • Published Mar 6, 2025 • 9
JiuZhang3.0-Corpus Collection Corpura for training JiuZhang3.0 • 3 items • Updated May 24, 2024 • 1