MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech
Wang Chengyao PRO
wcy1122
AI & ML interests
Multimodal Intelligence
Recent Activity
upvoted a paper 4 days ago
VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models upvoted a paper 9 days ago
Efficient Reasoning with Balanced Thinking upvoted a paper 25 days ago
Utonia: Toward One Encoder for All Point Clouds