Running on CPU Upgrade Featured 3.17k The Smol Training Playbook 📚 3.17k The secrets to building world-class LLMs
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 775
Qwen/Qwen3-Embedding-0.6B Feature Extraction • 0.6B • Updated 26 days ago • 6.03M • • 1.02k
bigcode/self-oss-instruct-sc2-exec-filter-50k Viewer • Updated Nov 4, 2024 • 50.7k • 8.13k • 106
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation • 33B • Updated Jan 12, 2025 • 941k • • 2.02k
Qwen/Qwen2.5-Coder-14B-Instruct Text Generation • 15B • Updated Jan 12, 2025 • 1.33M • • 155