Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

2,399

Full-text search

Active filters: quantized

nvidia/DeepSeek-R1-0528-FP4

Text Generation • Updated Jun 9 • 69.5k • 33

duydq12/nomic-embed-code-FP8-dynamic

Text Generation • 8B • Updated Jun 9 • 22 • 1

QuantStack/Phantom_Wan_14B_FusionX-GGUF

Image-to-Video • 14B • Updated Jun 12 • 6.21k • 28

muranAI/gemma-3n-E4B-it-GGUF

Text Generation • 7B • Updated 25 days ago • 2.05k • 2

mzbac/flux1.kontext.8bit.mlx

Image-to-Image • Updated 18 days ago • 1

ravenscroftj/CodeGen-350M-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 2

ravenscroftj/CodeGen-2B-multi-ggml-quant

Text Generation • Updated Aug 5, 2023 • 2

ravenscroftj/CodeGen-6B-multi-ggml-quant

Text Generation • Updated Apr 24, 2023 • 9

ethzanalytics/dolly-v2-12b-sharded-8bit

Text Generation • Updated Apr 29, 2023 • 12 • 4

ethzanalytics/dolly-v2-7b-sharded-8bit

Text Generation • Updated Jun 28, 2023 • 6 • 1

pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit

Summarization • 3B • Updated Jan 21 • 16

ethzanalytics/stablelm-tuned-alpha-7b-sharded-8bit

Text Generation • Updated May 4, 2023 • 9 • 2

rozek/OpenLLaMA_7B_300BT_q4

Text Generation • Updated May 5, 2023 • 1

ethzanalytics/stablelm-tuned-alpha-3b-gptq-4bit-128g

Text Generation • Updated May 7, 2023 • 10

kyo-takano/open-calm-7b-8bit

Text Generation • Updated May 28, 2023 • 7 • 10

CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA

Text Generation • Updated Jul 20, 2023 • 3

CONCISE/LLaMa_V2-13B-Chat-Uncensored-GGML

Text Generation • Updated Aug 7, 2023 • 14 • 7

CONCISE/LLaMa_V2-13B-Instruct-Uncensored-GGML

Text Generation • Updated Aug 17, 2023 • 21 • 5

rozek/LLaMA-2-7B-32K_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 368 • 9

rozek/LLaMA-2-7B-32K-Instruct_GGUF

Text Generation • 7B • Updated Aug 31, 2023 • 85 • 4

RedHatAI/bge-small-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 119 • 9

RedHatAI/bge-base-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 217 • 4

RedHatAI/bge-large-en-v1.5-quant

Feature Extraction • Updated Nov 13, 2023 • 28 • 22

afrideva/TinyLlama-1.1B-intermediate-step-715k-1.5T-GGUF

1B • Updated Nov 4, 2023 • 60

afrideva/tinyllama-colorist-v2-GGUF

Text Generation • 1B • Updated Nov 4, 2023 • 59

afrideva/stablelm-3b-4e1t-GGUF

Text Generation • 3B • Updated Nov 5, 2023 • 1.41k • 1

afrideva/tiny-llama-miniguanaco-1.5T-GGUF

Text Generation • 1B • Updated Nov 6, 2023 • 73

afrideva/Hermes-Trismegistus-Mistral-7B-GGUF

Text Generation • 7B • Updated Nov 5, 2023 • 64

afrideva/TinyLlama-1.1B-alpaca-chat-v1.5-GGUF

Text Generation • 1B • Updated Nov 6, 2023 • 81 • 2

afrideva/bling-sheared-llama-1.3b-0.1-GGUF

Text Generation • 1B • Updated Nov 6, 2023 • 43 • 1