VidEoMT: Your ViT is Secretly Also a Video Segmentation Model Paper • 2602.17807 • Published 4 days ago • 3
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism 12 days ago • 15
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 87
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 65
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 301