Running Featured 71 Distilling 100B+ Models 40x Faster with TRL 📝 71 TRL distillation for 100B+ teachers, 40x faster
Running on CPU Upgrade 225 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 225 Explore synthetic data experiments on a virtual bookshelf
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook 📚 3.12k The secrets to building world-class LLMs
Running 3.8k The Ultra-Scale Playbook 🌌 3.8k The ultimate guide to training LLM on large GPU Clusters
Running on L40S Agents 585 MinerU Document Extraction Tools 📚 585 Easy converting PDF and Office docs into Markdown and JSON
Running 596 Scaling test-time compute 📈 596 Run advanced search strategies to boost LLM problem solving
Running 5 PL-MTEB: Polish Massive Text Embedding Benchmark 📈 5 Display evaluation results in a leaderboard
Running Featured 1.04k Can You Run It? LLM version 🚀 1.04k Calculate GPU needs for running LLMs on your hardware