Running on CPU Upgrade 233 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 233 Explore synthetic data experiments on a virtual bookshelf
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
Running 3.85k The Ultra-Scale Playbook 🌌 3.85k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • 71B • Updated Feb 24, 2025 • 145k • • 773
Runtime error Agents Featured 1.01k Model Memory Utility 🚀 1.01k Calculate GPU memory needed for training Hugging Face models