view post Post 2209 We just released our latest Shisa V2.1 Japanese multi-lingual models: https://huggingface.co/collections/shisa-ai/shisa-v21Besides updates to our 14B, and 70B, we have a new LFM2-based 1.2B, Llama 3.2-based 3B, and Qwen 3-based 8B, all with class-leading Japanese language capabilities.Per usual, lots of details in the Model Cards for those interested. See translation 1 reply · 🔥 5 5 + Reply
Running on CPU Upgrade Featured 2.92k The Smol Training Playbook 📚 2.92k The secrets to building world-class LLMs
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 507
🎯 Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 13 days ago • 107
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 204k • 1.1k
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation • 81B • Updated Sep 17, 2025 • 1.63M • • 911