view article Article Fine-tuning Llama 2 70B using PyTorch FSDP +2 smangrul, sgugger, lewtun, philschmid • Sep 13, 2023 • 32
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul • May 24, 2023 • 180
view article Article Introducing RWKV - An RNN with the advantages of a transformer +2 BlinkDL, Hazzzardous, sgugger, ybelkada • May 15, 2023 • 25
view article Article How 🤗 Accelerate runs very large models thanks to PyTorch sgugger • Sep 27, 2022 • 18
view article Article Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate stas, sgugger • Sep 16, 2022 • 1
view article Article Accelerate Large Model Training using DeepSpeed smangrul, sgugger • Jun 28, 2022 • 7
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel smangrul, sgugger • May 2, 2022 • 9