Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published 14 days ago • 21
Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts Paper • 2601.22156 • Published 13 days ago • 10
KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices Paper • 2601.21579 • Published 13 days ago • 6
DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents Paper • 2601.20975 • Published 14 days ago • 9
Reinforcement World Model Learning for LLM-based Agents Paper • 2602.05842 • Published 6 days ago • 22
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Paper • 2602.06855 • Published 5 days ago • 64