SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 3 days ago • 57
MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI Paper • 2605.08678 • Published 8 days ago • 8
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation Paper • 2605.08029 • Published 9 days ago • 10
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 13 days ago • 326
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 18 items • Updated 11 days ago • 5
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 18 items • Updated 11 days ago • 5
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 20 days ago • 70
Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising Paper • 2604.26694 • Published 18 days ago • 6