ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining Paper • 2505.19893 • Published May 26, 2025
Adversarial Training for Defense Against Label Poisoning Attacks Paper • 2502.17121 • Published Feb 24, 2025
Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters
Optimistic Games for Combinatorial Bayesian Optimization with Application to Protein Design Paper • 2409.18582 • Published Sep 27, 2024