AI & ML interests
Large Multimodal Models
Organizations
None yet
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP
Image-Text-to-Text
• 1B • Updated
• 3.39k
• 7
Zhang199/EDGE-GRPO-Qwen-1.5B
Text Generation
• 2B • Updated
Zhang199/EDGE-GRPO-Qwen-7B
Text Generation
• 8B • Updated
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512
Video-Text-to-Text
• 4B • Updated
• 331
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512
Video-Text-to-Text
• 4B • Updated
• 24
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512
Video-Text-to-Text
• 3B • Updated
• 6
Zhang199/TinyLLaVA-Qwen2.5-3B-SigLIP
Image-Text-to-Text
• 4B • Updated
• 897
Zhang199/TinyLLaVA-Video-R1
Video-Text-to-Text
• 4B • Updated
• 10
• 4
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16
Video-Text-to-Text
• 4B • Updated
• 33
• 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512
Video-Text-to-Text
• 4B • Updated
• 5
Zhang199/subject_bert_mmmu
Text Classification
• 0.1B • Updated
• 1