Running 7 physics-intern: an Autonomous Agent for Physics Research 📝 7 Generate autonomous research reports for physics problems
Running 143 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 143 Building and scaling RL environments for LLM training
Running 18 Defeating the trainer-generator precision mismatch in TRL 🎯 18 Download research PDF (Pro access required)
Running Featured 80 Distilling 100B+ Models 40x Faster with TRL 📝 80 TRL distillation for 100B+ teachers, 40x faster