Bridging the Long-Term Gap: A Memory-Active Policy for Multi-Session Task-Oriented Dialogue Paper • 2505.20231 • Published May 26, 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning Paper • 2508.19996 • Published Aug 27, 2025
Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents Paper • 2512.20092 • Published 11 days ago • 8