Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories Paper • 2602.05085 • Published 12 days ago • 4
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Paper • 2512.15687 • Published Dec 17, 2025 • 20