Awesome tiny LLMs
updated
mistralai/Mixtral-8x7B-Instruct-v0.1
47B • Updated
• 782k
• 4.64k
berkeley-nest/Starling-LM-7B-alpha
Text Generation
• 7B • Updated
• 3.95k
• 559
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation
• Updated
• 28.2k
• 648
TomGrc/FusionNet_7Bx2_MoE_14B
Text Generation
• 13B • Updated
• 760
• 36
mlabonne/NeuralMarcoro14-7B
Text Generation
• 7B • Updated
• 1.34k
• 39
Text Generation
• 1B • Updated
• 20
• 17
Text Generation
• 7B • Updated
• 2.83k
• 48
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
• Updated
• 150k
• 886
alignment-handbook/zephyr-7b-sft-full
Text Generation
• 7B • Updated
• 4.73k
• • 27
openbmb/MiniCPM-2B-sft-bf16
Text Generation
• Updated
• 28.5k
• 123
Text Generation
• 3B • Updated
• 172k
• 1.15k
state-spaces/mamba-2.8b-hf
Text Generation
• 3B • Updated
• 6.7k
• 111
ridger/SpikeGPT-BookCorpus
Text Generation
• Updated
• 21
ridger/SpikeGPT-OpenWebText-216M
Updated
• 14
togethercomputer/StripedHyena-Nous-7B
Text Generation
• 8B • Updated
• 96
• 143
Text Generation
• 9B • Updated
• 2.47k
• 252
Text Generation
• 7B • Updated
• 387
• 58
Text Generation
• Updated
• 4.06k
• 45
Qwen/Qwen1.5-MoE-A2.7B-Chat
Text Generation
• Updated
• 32.4k
• 132
mistralai/Mistral-7B-Instruct-v0.3
7B • Updated
• 1.59M
• 2.45k
microsoft/Phi-3-medium-4k-instruct
Text Generation
• 14B • Updated
• 11.1k
• 224
microsoft/Phi-3-small-8k-instruct
Text Generation
• 7B • Updated
• 17.2k
• 175
Text Generation
• 0.4B • Updated
• 2.03k
• 19
mistralai/Mamba-Codestral-7B-v0.1
7B • Updated
• 29.2k
• 612
meta-llama/Llama-3.1-8B-Instruct
Text Generation
• Updated
• 7.28M
• • 5.53k
Text Generation
• Updated
• 412k
• • 1.3k
tiiuae/falcon-mamba-7b-instruct
Text Generation
• 7B • Updated
• 6.39k
• 70
Updated
• 3
• 4
goombalab/Hybrid-Phi-Mamba
Updated
• 1
• 4
nvidia/mamba2-hybrid-8b-3t-4k
Text Generation
• Updated
• 74
Text Generation
• 8B • Updated
• 20.9M
• • 1.11k
stabilityai/stablelm-2-1_6b-chat
Text Generation
• 2B • Updated
• 1.04k
• 34
state-spaces/mamba2attn-2.7b
Updated
• 71
• 8
Updated
• 1.17k
• 19
Text Generation
• 1B • Updated
• 1.49M
• 2.31k
Text Generation
• 2B • Updated
• 366
• 157
tiiuae/Falcon3-Mamba-7B-Base
Text Generation
• 7B • Updated
• 275
• 23
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
• 8B • Updated
• 719k
• • 799
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• Updated
• 1.47M
• • 1.46k
microsoft/Phi-4-mini-instruct
Text Generation
• Updated
• 269k
• 695
EpistemeAI/DeepThinkers-Phi4
Text Generation
• 15B • Updated
• 5
mradermacher/phi-4-abliterated-i1-GGUF
15B • Updated
• 385
• 7
prithivMLmods/Phi-4-Super-1
Text Generation
• 15B • Updated
• 16
• 8
microsoft/bitnet-b1.58-2B-4T-gguf
Text Generation
• 2B • Updated
• 18.6k
• 239
Text Generation
• Updated
• 5.51M
• 564
google/gemma-3n-E4B-it-litert-preview
Image-Text-to-Text
• Updated
• 1.48k
Text Generation
• 0.8B • Updated
• 11.8M
• 1.12k
Text Generation
• Updated
• 116k
• 903
Text Generation
• Updated
• 455k
• 352
microsoft/Phi-4-mini-flash-reasoning
Text Generation
• Updated
• 27.2k
• 269
Text Generation
• 0.2B • Updated
• 34
• 2
Text Generation
• Updated
• 135k
• 992
Text Generation
• Updated
• 114k
• 562
Image-Text-to-Text
• Updated
• 64.5k
• 1.07k
Translation
• 8B • Updated
• 9.99k
• 551
swiss-ai/Apertus-8B-Instruct-2509
Text Generation
• Updated
• 141k
• • 437
Text Generation
• Updated
• 33.7k
• 384
ffurfaro/Titans-Qwen2.5-1.5B
Text Generation
• Updated
• 1
facebook/MobileLLM-R1-950M
Text Generation
• Updated
• 1.23k
• 280
xTimeCrystal/MiniModel-200M-Base
Text Generation
• Updated
• 16
• 30
Text Generation
• 3B • Updated
• 25.9k
• 180
Text Generation
• 8B • Updated
• 48.9k
• 331
unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF
Text Generation
• 80B • Updated
• 40.4k
• 166
mradermacher/Huihui-GLM-4.7-Flash-abliterated-i1-GGUF
30B • Updated
• 5.24k
• 9
unsloth/GLM-4.7-Flash-REAP-23B-A3B-GGUF
Text Generation
• 23B • Updated
• 47k
• 171
MuXodious/GLM-4.7-Flash-REAP-23B-A3B-absolute-heresy-GGUF
Text Generation
• 23B • Updated
• 3.29k
• 10
cerebras/Kimi-Linear-REAP-35B-A3B-Instruct
Text Generation
• 35B • Updated
• 98
• 67
moonshotai/Kimi-Linear-48B-A3B-Instruct
Text Generation
• 49B • Updated
• 37.3k
• 547
tiiuae/Falcon-H1-Tiny-R-90M
Text Generation
• 91.1M • Updated
• 487
• 25
inclusionAI/LLaDA2.1-mini
Text Generation
• 16B • Updated
• 26.3k
• 98
unsloth/Qwen3.5-35B-A3B-GGUF
Image-Text-to-Text
• 35B • Updated
• 919k
• 547
mradermacher/Qwen3.5-35B-A3B-GGUF
35B • Updated
• 2.4k
• 1
mradermacher/Qwen3.5-27B-heretic-i1-GGUF
27B • Updated
• 11.7k
• 7
mradermacher/Qwen3.5-9B-heretic-GGUF
9B • Updated
• 25.8k
• 9