DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24, 2025 • 101
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning Paper • 2505.24850 • Published May 30, 2025 • 8