Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 14 days ago • 23
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 28 days ago • 104