Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Inference Optimization
community
Activity Feed
Follow
26
AI & ML interests
None defined yet.
Recent Activity
ChibuUkachi
updated
a model
2 days ago
inference-optimization/Qwen3-235B-A22B-Thinking-2507.w8a8
ChibuUkachi
updated
a model
2 days ago
inference-optimization/Qwen3-235B-A22B-Thinking-2507.w4a16
ChibuUkachi
updated
a model
4 days ago
inference-optimization/final-ctest-Qwen3-8B-speculator.dflash
View all activity
Team members
17
inference-optimization
's models
336
Sort: Recently updated
inference-optimization/Qwen3-8B_6_bits_mode_noise
7B
•
Updated
Mar 12
•
2
inference-optimization/Qwen3-8B_6_bits_mode_hybrid
7B
•
Updated
Mar 12
•
5
inference-optimization/Qwen3-8B_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
1
inference-optimization/Qwen3-8B_5.5_bits_mode_noise
6B
•
Updated
Mar 12
inference-optimization/Qwen3-8B_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
1
inference-optimization/Qwen3-8B_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
1
inference-optimization/Qwen3-8B_5_bits_mode_noise
6B
•
Updated
Mar 12
inference-optimization/Qwen3-8B_5_bits_mode_hybrid
6B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_heuristic
7B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_noise
7B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_hybrid
7B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_heuristic
7B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_noise
7B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_hybrid
7B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_heuristic
6B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_noise
6B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_hybrid
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_heuristic
6B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_noise
6B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_hybrid
6B
•
Updated
Mar 12
•
1
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_heuristic
6B
•
Updated
Mar 12
•
2
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_noise
6B
•
Updated
Mar 12
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_hybrid
6B
•
Updated
Mar 12
inference-optimization/sarvam-105b-FP8-Dynamic
Text Generation
•
106B
•
Updated
Mar 9
•
11
inference-optimization/sarvam-30b-FP8-Dynamic
Text Generation
•
32B
•
Updated
Mar 9
•
29
•
1
inference-optimization/sarvam-30b-NVFP4
Text Generation
•
19B
•
Updated
Mar 9
•
10
•
1
inference-optimization/sarvam-105b-NVFP4
61B
•
Updated
Mar 9
•
3
•
1
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic
35B
•
Updated
Mar 6
•
2
inference-optimization/gpt-oss-20b-FP8-Dynamic
21B
•
Updated
Mar 5
•
9
•
1
inference-optimization/Qwen3-30B-A3B-Instruct-2507-NVFP4
17B
•
Updated
Mar 4
•
46
Previous
1
...
8
9
10
11
12
Next