PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference
Paper • 2603.25730 • Published • 45
Computer Vision
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision