Mu Cai's picture

Mu Cai

mucai

·

https://pages.cs.wisc.edu/~mucai/

AI & ML interests

Computer Vision, Deep Learning, 3D Vision, Vision and Language,

Recent Activity

upvoted a paper about 1 month ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

submitted a paper about 1 month ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

upvoted a paper 5 months ago

Relational Visual Similarity

View all activity

Organizations

submitted a paper to Daily Papers about 1 month ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published Mar 26 • 13

authored 2 papers over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

authored 2 papers almost 2 years ago

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34