Running Featured 68 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 68 Who needs 1T parameters? Olympiad proofs with a 4B model
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated about 17 hours ago • 559k • 178
TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 5 items • Updated 9 days ago • 67
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 22 days ago • 87