UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B Image-Text-to-Text β’ 4B β’ Updated Jul 30, 2025 β’ 1.01k β’ 5
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Paper β’ 2504.11468 β’ Published Apr 10, 2025 β’ 30