Running 21 physics-intern: an Autonomous Agent for Physics Research 📝 21 Generate autonomous research reports for physics problems
Running 159 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 159 Building and scaling RL environments for LLM training
Running 18 Defeating the trainer-generator precision mismatch in TRL 🎯 18 Download research PDF (Pro access required)
Running Featured 84 Distilling 100B+ Models 40x Faster with TRL 📝 84 TRL distillation for 100B+ teachers, 40x faster