Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 21 days ago • 252
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published Dec 10, 2025 • 5
TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence Paper • 2505.24500 • Published May 30, 2025 • 12