Sports Video Understanding Benchmarks
AI & ML interests
Computer Vision; Video Understanding; Action Recognition
Recent Activity
Papers
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
SAM 2++: Tracking Anything at Any Granularity
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 1 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 49.3k • 50 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 22.8k • 45 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 2.11k • 7
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 330 • 68 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 42 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 565 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 1.12k • 22
Sports Video Understanding Benchmarks
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 1 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 49.3k • 50 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 22.8k • 45 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 2.11k • 7
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 330 • 68 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 42 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 565 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 1.12k • 22
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).