Taming generative video models for zero-shot optical flow extraction Paper • 2507.09082 • Published Jul 11, 2025 • 13
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 28
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos Paper • 2411.11409 • Published Nov 18, 2024