Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

2,399

Full-text search

Active filters: quantized

Makatia/mistral-7b-instruct-v0.2.Q8_0-Q8_0.gguf

7B • Updated 10 days ago • 27

Makatia/microsoft_Phi-3-mini-4k-instruct_onnx_rpi

Updated 10 days ago

Makatia/TinyLlama_TinyLlama-1.1B-Chat-v1.0_onnx

Updated 10 days ago • 1 • 1

JonathanMiddleton/Qwen3-Reranker-4B-GGUF

Text Ranking • 4B • Updated 10 days ago • 81

ramblingpolymath/Qwen3-32B-W8A8

Text Generation • 33B • Updated 9 days ago • 202

steampunque/Deepseek-R1-Distill-Llama-8B-Hybrid-GGUF

8B • Updated 9 days ago • 21

adamrb/mpt-30b-chat-w4a16-gptq

4B • Updated 8 days ago • 8

adamrb/mpt-30b-chat-w8a8-gptq

8B • Updated 8 days ago • 9

tcpipuk/DavidAU-Gemma-3-4b-it-Uncensored-DBL-X-GGUF

Text Generation • 4B • Updated 7 days ago • 288

tachyphylaxis/DeepSeek-R1-0528-FP4

Text Generation • Updated 6 days ago • 64

PJEDeveloper/Mistral_Nemo_Instruct_2407-F16.gguf-Q4_K_M

12B • Updated about 5 hours ago • 36

sdurgi/bert_emotion_response_classifier_quantized

Text Classification • Updated 5 days ago • 5

mirekphd/whisper-large-v3-onnx-fp16

Automatic Speech Recognition • Updated 4 days ago • 1

mirekphd/whisper-large-v3-onnx-w8a16-dynamic

Automatic Speech Recognition • Updated 4 days ago • 1

mirekphd/whisper-large-v3-onnx-w4a16-dynamic

Automatic Speech Recognition • Updated 4 days ago • 1

steampunque/Qwen2.5-VL-32B-Instruct-Hybrid-GGUF

0.7B • Updated 4 days ago • 24

theprint/Zeth-Gemma3-4B-GGUF

Text Generation • 5B • Updated 4 days ago • 91

steampunque/GLM-Z1-9B-0414-Hybrid-GGUF

9B • Updated 2 days ago • 4

YongdongWang/llama-3.2-1b-lora-qlora-dart-llm-gguf

Text Generation • 1B • Updated 2 days ago

YongdongWang/llama-3.1-8b-lora-qlora-dart-llm-gguf

Text Generation • 8B • Updated about 14 hours ago

YongdongWang/llama-3.2-3b-lora-qlora-dart-llm-gguf

Text Generation • 3B • Updated 1 day ago

sugiv/cardvaultplus-500m-gguf

Image-to-Text • 0.4B • Updated 1 day ago

PJEDeveloper/mistralai_Mistral-7B-Instruct-v0.3-F16.gguf-Q5_K_M

7B • Updated 1 day ago

nufikq/tinyllama-php-finetuned-test1

Updated about 20 hours ago

Tohirju/Ameena_Qwen3-8B_e3_Quantised_gguf

8B • Updated about 21 hours ago

Bastion-AI/SmolLM3-3B-GGUF

3B • Updated about 15 hours ago

Danucore/Qwen3-235B-A22B-Instruct-2507-FP4

Text Generation • Updated about 13 hours ago

Danucore/Qwen3-Coder-480B-A35B-Instruct-FP4

Text Generation • Updated about 12 hours ago

PJEDeveloper/mistralai_Mistral-7B-Instruct-v0.2-Q5_K_M

7B • Updated about 5 hours ago