Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
20
AI & ML interests
None defined yet.
Recent Activity
krishnateja95
updated
a collection
1 day ago
HIGGS
krishnateja95
updated
a collection
1 day ago
HIGGS
krishnateja95
updated
a collection
1 day ago
HIGGS
View all activity
Team members
15
inference-optimization
's models
233
Sort: Recently updated
inference-optimization/DeepSeek-V3-debug-multiply-NVFP4A16
0.9B
•
Updated
Jan 23
inference-optimization/DeepSeek-V3-debug-add-NVFP4A16
0.9B
•
Updated
Jan 23
•
5
inference-optimization/DeepSeek-V3-debug-empty-NVFP4A16
0.9B
•
Updated
Jan 23
•
107
inference-optimization/DeepSeek-V3-debug-add
1B
•
Updated
Jan 23
•
5
inference-optimization/DeepSeek-V3-debug-multiply
1B
•
Updated
Jan 23
•
12
inference-optimization/Qwen3-0.6B-debug-add-FP8_BLOCK
0.6B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-debug-multiply-FP8_BLOCK
0.6B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-FP8_BLOCK
0.6B
•
Updated
Jan 23
•
56
inference-optimization/Qwen3-0.6B-debug-add-W4A16-G128
0.2B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-debug-multiply-W4A16-G128
0.2B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-W4A16-G128
0.2B
•
Updated
Jan 23
•
97
inference-optimization/Qwen3-0.6B-debug-add
0.6B
•
Updated
Jan 23
inference-optimization/Qwen3-0.6B-debug-multiply
0.6B
•
Updated
Jan 23
•
3
inference-optimization/DeepSeek-V3-debug-empty
1B
•
Updated
Jan 23
•
211
inference-optimization/granite-4.0-h-tiny-FP8-block
Text Generation
•
7B
•
Updated
Jan 23
•
4
inference-optimization/granite-4.0-h-tiny-quantized.w8a8
7B
•
Updated
Jan 23
•
1
inference-optimization/granite-4.0-h-tiny-NVFP4
Updated
Jan 22
inference-optimization/granite-4.0-h-tiny-quantized.w4a16
Updated
Jan 22
inference-optimization/granite-4.0-h-small-quantized.w8a8
Updated
Jan 19
inference-optimization/granite-4.0-h-small-NVFP4
Updated
Jan 19
inference-optimization/granite-4.0-h-small-quantized.w4a16
Updated
Jan 19
inference-optimization/granite-4.0-h-small-FP8-dynamic
Updated
Jan 19
inference-optimization/granite-4.0-h-small-FP8-block
Updated
Jan 19
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
18B
•
Updated
Jan 15
•
4
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
Jan 9
•
3
inference-optimization/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation
•
81B
•
Updated
Jan 9
•
11
inference-optimization/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation
•
81B
•
Updated
Jan 9
•
12
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-quantized.w4a16
6B
•
Updated
Jan 7
•
5
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8-dynamic
32B
•
Updated
Jan 6
•
1
inference-optimization/Qwen3-Next-80B-A3B-Thinking-quantized.w8a8
Updated
Dec 24, 2025
Previous
1
...
5
6
7
8
Next