view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke • Aug 23, 2023 • 64
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul • May 24, 2023 • 180
view article Article Overview of natively supported quantization schemes in 🤗 Transformers +3 ybelkada, marcsun13, IlyasMoutawwakil, clefourrier, fxmarty • Sep 12, 2023 • 13
PEFT papers Collection A collection of methods that have been implemented in the 🤗 PEFT library • 12 items • Updated Jan 30, 2024 • 32
A little guide to building Large Language Models in 2024 Collection Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 17 items • Updated Mar 2 • 17
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes ybelkada, timdettmers • Aug 17, 2022 • 132