Running 3.74k The Ultra-Scale Playbook ๐ 3.74k The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.1-8B-Instruct Text Generation โข 8B โข Updated Sep 25, 2024 โข 7.63M โข โข 5.58k