Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

speculative-decoding

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

103

Full-text search

Active filters: speculative-decoding

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 8, 2025 • 102

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Jan 1 • 168 • 1

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0

0.6B • Updated Aug 10, 2025 • 1 • 1

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 9, 2025 • 19

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10, 2025 • 23

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Jan 1 • 121

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 10, 2025 • 68

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0

0.7B • Updated Aug 11, 2025 • 3 • 1

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF

0.7B • Updated Aug 11, 2025 • 29

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated Aug 12, 2025 • 62

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated Dec 9, 2025 • 105

nm-testing/llama4-scout-17b-eagle3-dummy-drafter

Updated Aug 27, 2025

RedHatAI/Llama-4-Maverick-17B-128E-Instruct-speculator.eagle3

Updated Dec 2, 2025 • 7

HathoraResearch/qwen3_30b_moe_eagle3-ultra-1k-sample

0.2B • Updated Sep 10, 2025 • 3 • 2

husj576/GTO-llama31-instruct-8B

Text Generation • Updated 7 days ago • 27

nm-testing/Llama4-Maverick-Eagle3-Speculators-64k-vocab

Updated Oct 22, 2025

jukofyork/Mistral-Large-Instruct-2411-DRAFT-0.4B-v3.0

0.4B • Updated Oct 28, 2025 • 1

jukofyork/Mistral-Large-Instruct-2411-DRAFT-0.4B-v3.0-GGUF

0.4B • Updated Oct 28, 2025 • 47

mradermacher/Mistral-Large-Instruct-2411-DRAFT-0.4B-v3.0-GGUF

0.4B • Updated Oct 29, 2025 • 67

jukofyork/command-a-03-2025-DRAFT-0.8B-v3.0

0.8B • Updated Oct 29, 2025 • 1 • 1

jukofyork/command-a-03-2025-DRAFT-0.8B-v3.0-GGUF

0.8B • Updated Oct 29, 2025 • 46

taobao-mnn/Qwen3-4B-Instruct-2507-Eagle3

Text Generation • 0.2B • Updated Oct 30, 2025 • 123 • 1

taobao-mnn/Qwen3-VL-2B-Instruct-Eagle3

Text Generation • 0.1B • Updated Oct 31, 2025 • 97 • 6

mradermacher/command-a-03-2025-DRAFT-0.8B-v3.0-GGUF

0.8B • Updated Oct 30, 2025 • 75

mradermacher/command-a-03-2025-DRAFT-0.8B-v3.0-i1-GGUF

0.8B • Updated Dec 10, 2025 • 235

taobao-mnn/Qwen3-VL-4B-Instruct-Eagle3

Text Generation • 0.2B • Updated Nov 3, 2025 • 126

taobao-mnn/Qwen3-VL-2B-Thinking-Eagle3

Text Generation • 0.1B • Updated Nov 3, 2025 • 66

taobao-mnn/Qwen3-VL-4B-Thinking-Eagle3

Text Generation • 0.2B • Updated Nov 10, 2025 • 103 • 1

taobao-mnn/Qwen3-4B-Thinking-2507-Eagle

Text Generation • 0.2B • Updated Nov 10, 2025 • 108 • 1

Zjcxy-SmartAI/Eagle3-Qwen3-8B-zh

Text Generation • Updated Dec 15, 2025 • 28 • 3