Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 2 days ago • 8
SketchJudge: A Diagnostic Benchmark for Grading Hand-drawn Diagrams with Multimodal Large Language Models Paper • 2601.06944 • Published 2 days ago • 1
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests Paper • 2601.06953 • Published 2 days ago • 30
IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck Paper • 2601.05870 • Published 4 days ago • 2
Over-Searching in Search-Augmented Large Language Models Paper • 2601.05503 • Published 5 days ago • 5
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published 4 days ago • 20
Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals Paper • 2601.05848 • Published 4 days ago • 13
GenCtrl -- A Formal Controllability Toolkit for Generative Models Paper • 2601.05637 • Published 5 days ago • 2
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control Paper • 2601.05138 • Published 5 days ago • 16
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published 5 days ago • 32
ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing Paper • 2601.03467 • Published 7 days ago • 5
Agentic Rubrics as Contextual Verifiers for SWE Agents Paper • 2601.04171 • Published 6 days ago • 10
Klear: Unified Multi-Task Audio-Video Joint Generation Paper • 2601.04151 • Published 6 days ago • 13
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 9 days ago • 15