nyu-visionx/Cambrian-S-3M
Updated
•
736
•
2
None defined yet.
SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts