view article Article Training Design for Text-to-Image Models: Lessons from Ablations 3 days ago β’ 47
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 10 days ago β’ 48
Running on CPU Upgrade Featured 2.96k The Smol Training Playbook π 2.96k The secrets to building world-class LLMs
π¨βπ³Cooking with HF: HPO with Transformers and Optuna Collection My HF recipe space and model. β’ 2 items β’ Updated Dec 10, 2025