Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 2 days ago • 143
Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training Paper • 2601.03256 • Published 11 days ago • 6
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 12 days ago • 25
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 15 days ago • 52
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 29 days ago • 96
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Paper • 2512.14696 • Published Dec 16, 2025 • 7
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published Dec 2, 2024 • 86
TRELLISWorld: Training-Free World Generation from Object Generators Paper • 2510.23880 • Published Oct 27, 2025 • 2
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published Dec 9, 2025 • 130
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels Paper • 2512.08358 • Published Dec 9, 2025 • 5
LATTICE: Democratize High-Fidelity 3D Generation at Scale Paper • 2512.03052 • Published Nov 24, 2025 • 10
NaTex: Seamless Texture Generation as Latent Color Diffusion Paper • 2511.16317 • Published Nov 20, 2025 • 15
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published Nov 13, 2025 • 96
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published Nov 12, 2025 • 206