to read
updated
GenEx: Generating an Explorable World
Paper
• 2412.09624
• Published
• 98
Image-to-Video
• Updated
• 93
• 610
Track4Gen: Teaching Video Diffusion Models to Track Points Improves
Video Generation
Paper
• 2412.06016
• Published
• 20
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published
• 108
Paper
• 2412.15115
• Published
• 377
Alibaba-NLP/gte-multilingual-mlm-base
Fill-Mask
• 0.3B • Updated
• 1.75k
• 15
answerdotai/ModernBERT-large
Fill-Mask
• Updated
• 193k
• 460
Parallelized Autoregressive Visual Generation
Paper
• 2412.15119
• Published
• 53
Taming Multimodal Joint Training for High-Quality Video-to-Audio
Synthesis
Paper
• 2412.15322
• Published
• 20
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers
Up
Paper
• 2412.16112
• Published
• 23
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper
• 2501.05441
• Published
• 95
Fill-Mask
• 2B • Updated
• 1.66k
• 66
"Principal Components" Enable A New Language of Images
Paper
• 2503.08685
• Published
• 12
Causal-Copilot: An Autonomous Causal Analysis Agent
Paper
• 2504.13263
• Published
• 7
Paper2Code: Automating Code Generation from Scientific Papers in Machine
Learning
Paper
• 2504.17192
• Published
• 123
Vid2World: Crafting Video Diffusion Models to Interactive World Models
Paper
• 2505.14357
• Published
• 27
PixNerd: Pixel Neural Field Diffusion
Paper
• 2507.23268
• Published
• 52