VLAA-Thinker - a UCSC-VLAA Collection

UCSC-VLAA 's Collections

GPT-Image-Edit-1.5M

m1

CLIPS

CLIPA

Recap-DataComp-1B

HQ-Edit

VLAA-Thinker

updated Sep 3, 2025

UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B

Image-Text-to-Text • 4B • Updated Jul 30, 2025 • 1.01k • 5
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B

Image-Text-to-Text • 8B • Updated Jul 30, 2025 • 288 • 2
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B

Image-Text-to-Text • 2B • Updated Jul 30, 2025 • 9 • 1
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B

Image-Text-to-Text • 8B • Updated Jul 30, 2025 • 11
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B-Zero

Image-Text-to-Text • 8B • Updated Jul 30, 2025 • 8
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10, 2025 • 30
UCSC-VLAA/VLAA-Thinking

Viewer • Updated Sep 27, 2025 • 2 • 885 • 20