MemPO: Self-Memory Policy Optimization for Long-Horizon Agents Paper • 2603.00680 • Published Apr 9 • 1