STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published 10 days ago • 37 • 2
WildTableBench: Benchmarking Multimodal Foundation Models on Table Understanding In the Wild Paper • 2605.01018 • Published 16 days ago • 6 • 2
RouteProfile: Elucidating the Design Space of LLM Profiles for Routing Paper • 2605.00180 • Published 17 days ago • 27 • 2
Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding Paper • 2605.07637 • Published 5 days ago • 16 • 3
Long Context Pre-Training with Lighthouse Attention Paper • 2605.06554 • Published 10 days ago • 18 • 2
Federation of Experts: Communication Efficient Distributed Inference for Large Language Models Paper • 2605.06206 • Published 10 days ago • 1 • 2
Retrieval from Within: An Intrinsic Capability of Attention-Based Models Paper • 2605.05806 • Published 9 days ago • 5 • 2
KL for a KL: On-Policy Distillation with Control Variate Baseline Paper • 2605.07865 • Published 9 days ago • 17 • 3
AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents Paper • 2605.06607 • Published 5 days ago • 1 • 2
SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety Paper • 2605.05704 • Published 10 days ago • 1 • 2
FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation Paper • 2605.04651 • Published 9 days ago • 1 • 2
SEIF: Self-Evolving Reinforcement Learning for Instruction Following Paper • 2605.07465 • Published 9 days ago • 28 • 2
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States Paper • 2605.07579 • Published 9 days ago • 15 • 3
World Model for Robot Learning: A Comprehensive Survey Paper • 2605.00080 • Published 17 days ago • 16 • 2
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 5 days ago • 11 • 2
Relit-LiVE: Relight Video by Jointly Learning Environment Video Paper • 2605.06658 • Published 10 days ago • 15 • 2
Implicit Preference Alignment for Human Image Animation Paper • 2605.07545 • Published 9 days ago • 1 • 2
Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs Paper • 2605.07153 • Published 9 days ago • 6 • 2