Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

447

Full-text search

Active filters: quantization

aghatage/SFR-Embedding-2_R-4bit-NF4

Feature Extraction • 7B • Updated Sep 23 • 3

ShahzebKhoso/Qwen3Guard-Gen-8B-GGUF

8B • Updated Sep 24 • 158 • 1

Sunbird/Sunflower-14B-FP8

Text Generation • 15B • Updated 17 days ago • 16

Sunbird/Sunflower-14B-FP4A16

Text Generation • 9B • Updated 17 days ago

Sunbird/Sunflower-32B-FP8

Text Generation • 33B • Updated 17 days ago • 21

Sunbird/Sunflower-32B-FP4A16

Text Generation • 19B • Updated 17 days ago • 1

Lerelou/Brains4b.q4_k_m-GGUF

4B • Updated 5 days ago • 45 • 1

SandLogicTechnologies/Qwen3-4B-Thinking-2507-GGUF

Text Generation • 4B • Updated 28 days ago • 10

ShahzebKhoso/Qwen3-4B-SafeRL-GGUF

4B • Updated 26 days ago • 155

Sunbird/Sunflower-32B-W8A8

Text Generation • 33B • Updated 17 days ago • 38

Sunbird/Sunflower-14B-W8A8

Text Generation • 15B • Updated 17 days ago • 12

softjapan/softjapan-model-gguf

Text Generation • 3B • Updated 23 days ago • 39

neonconverse/gemma-3-27b-abliterated-awq-4bit

Updated 20 days ago

Dhana8907/Llama-3.1-8B-Instruct-4bit

Text Generation • 8B • Updated 17 days ago • 52

ranjan56cse/gpt2-large-agnews-quantization-bitsandbytes

Text Classification • Updated 18 days ago

Varadrajan/llama-3.1-8b-alpaca-finetuned_8bit_gguf

Text Generation • 8B • Updated 18 days ago • 55

YuvrajSingh9886/facebook-opt-350m-8bit-bnb

Text Generation • 0.3B • Updated 14 days ago • 21

Bellesteck/Apriel-1.5-15b-Thinker-FP8-W8A8

Image-Text-to-Text • 14B • Updated 13 days ago • 52

Ram07/bitskip-v1-earlyexit

Text Generation • 1.0B • Updated 12 days ago • 7

Ram07/bitskip-v2-earlyexit

Text Generation • 1.0B • Updated 12 days ago • 11

Ram07/bitskip-v3-earlyexit

Text Generation • 1.0B • Updated 12 days ago • 14

Ram07/llama3-earlyexit

Text Generation • 1.0B • Updated 12 days ago • 13

raining-codes/Gemma3-1B-LOMO-q4f16_1-MLC

Text Generation • Updated 8 days ago • 60

raining-codes/Gemma3-270M-LOMO-q4f16_1-MLC

Text Generation • Updated 8 days ago • 79

raining-codes/Qwen3-0.6B-LOMO-q4f16_1-MLC

Text Generation • Updated 8 days ago • 67

raining-codes/Qwen3-1.7B-LOMO-q4f16_1-MLC

Text Generation • Updated 7 days ago • 91

raining-codes/Qwen3-1.7B-LOMO-q4f16_1-MLC2

Text Generation • Updated 6 days ago • 14