Base Model
updated
mistralai/Mistral-Small-3.1-24B-Base-2503
Updated • 5.37k
• 273
Text Generation
• Updated • 429k
• • 169
Text Generation
• 22B • Updated • 7.71M
• • 4.62k
Text Generation
• 685B • Updated • 3.92M
• • 13.3k
Text Generation
• Updated • 9.25k
• 302
baidu/ERNIE-4.5-0.3B-Base-PT
Text Generation
• Updated • 1.54k
• 22
Text Generation
• 1B • Updated • 1.74M
• • 2.39k
Updated • 44.1k
• 1.09k
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 609k
• • 1.51k
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text
• 30B • Updated • 1.53k
• 539
deepseek-ai/DeepSeek-R1-Zero
Text Generation
• 685B • Updated • 7.54k
• 957
Text Generation
• 9B • Updated • 31.1k
• • 104
Text Generation
• 0.4B • Updated • 21.7k
• 248
Text Generation
• Updated • 361
• 41
microsoft/Phi-4-mini-flash-reasoning
Text Generation
• Updated • 807
• 275
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
• 2B • Updated • 125M
• 407
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
• 685B • Updated • 222k
• • 994
tencent/Hunyuan-0.5B-Pretrain
Text Generation
• 0.5B • Updated • 1.69k
• 11
Text Generation
• 7B • Updated • 246k
• 69