internlm/OREAL-DeepSeek-R1-Distill-Qwen-7B
Text Generation
•
8B
•
Updated
•
27
•
9
None defined yet.
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
Think Visually, Reason Textually: Vision-Language Synergy in ARC