Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks Paper • 2602.01630 • Published 4 days ago • 46
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation Paper • 2602.02214 • Published 3 days ago • 23
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published 3 days ago • 29
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 6 days ago • 79
Show, Don't Tell: Morphing Latent Reasoning into Image Generation Paper • 2602.02227 • Published 3 days ago • 10
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published 15 days ago • 42
Future Optical Flow Prediction Improves Robot Control & Video Generation Paper • 2601.10781 • Published 21 days ago • 19
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published 21 days ago • 12
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 21 days ago • 28
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper • 2601.10061 • Published 22 days ago • 30
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands Paper • 2512.24965 • Published Dec 31, 2025 • 42
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published 27 days ago • 23
AgentOCR: Reimagining Agent History via Optical Self-Compression Paper • 2601.04786 • Published 28 days ago • 29
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 27 days ago • 51
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 28 days ago • 166