Meta VL-JEPA - Vision-Language Prediction Models Collection Meta VL-JEPA Vision-Language Joint Embedding Predictive Architecture for video understanding • 6 items • Updated Jan 16 • 8