Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published 3 days ago • 40
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 4 days ago • 41
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 4 days ago • 41
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published 4 days ago • 23
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 4 days ago • 41
3DRS Collection Checkpoints of 3DRS (Huang et al., NeurIPS 25') with Qwen3-VL • 2 items • Updated 24 days ago
VaLR Collection Checkpoints of VaLR (Jeon et al., arXiv 26') and its variants • 5 items • Updated 24 days ago
VG-LLM Collection Checkpoints of VG-LLM (Zheng et al., NeurIPS 25') with Qwen3-VL • 2 items • Updated 24 days ago
VaLR Collection Checkpoints of VaLR (Jeon et al., arXiv 26') and its variants • 5 items • Updated 24 days ago