Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

7,394

Full-text search

Active filters: awq

tencent/Hunyuan-0.5B-Instruct-AWQ-Int4

Text Generation • 0.2B • Updated 6 days ago • 60 • 2

twhitworth/gpt-oss-120b-awq-w4a16

117B • Updated 19 days ago • 4.07k • 8

Valdemardi/DeepSeek-R1-Distill-Qwen-32B-AWQ

Text Generation • 6B • Updated Jan 20 • 2.72k • 35

Qwen/Qwen2.5-VL-3B-Instruct-AWQ

Image-Text-to-Text • 1B • Updated Apr 6 • 458k • 55

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • 13B • Updated Mar 7 • 47.6k • 64

Qwen/Qwen3-32B-AWQ

Text Generation • 6B • Updated May 21 • 205k • 103

Qwen/Qwen3-14B-AWQ

Text Generation • 3B • Updated May 21 • 119k • 34

bullerwins/Qwen3-30B-A3B-awq

5B • Updated May 3 • 5 • 2

cpatonn/Devstral-Small-2507-AWQ-4bit

Text Generation • 4B • Updated 30 days ago • 6.14k • 6

btbtyler09/Devstral-Small-2507-AWQ

Text Generation • 4B • Updated Jul 31 • 300 • 2

TMElyralab/DeepSeek-R1-0528-AWQ-W4AFP8

Text Generation • Updated 10 days ago • 31 • 2

openbmb/MiniCPM-V-4_5-AWQ

Image-Text-to-Text • 3B • Updated 5 days ago • 2.28k • 5

QuantTrio/DeepSeek-V3.1-AWQ-Lite

Text Generation • Updated 2 days ago • 215 • 1

groxaxo/Qwen3-32B-AWorld-W8A16

9B • Updated 4 days ago • 32 • 1

casperhansen/mpt-7b-8k-chat-awq

Text Generation • Updated Nov 4, 2023 • 16 • 3

casperhansen/falcon-7b-awq

Text Generation • Updated Nov 4, 2023 • 8 • 1

casperhansen/vicuna-7b-v1.5-awq

Text Generation • Updated Oct 31, 2023 • 6 • 3

casperhansen/vicuna-7b-v1.5-awq-gemv

Text Generation • Updated Oct 31, 2023 • 9 • 1

casperhansen/mpt-7b-8k-chat-awq-gemv

Text Generation • Updated Oct 31, 2023 • 14

casperhansen/opt-125m-awq

Text Generation • 0.1B • Updated Oct 31, 2023 • 1.33k • 3

casperhansen/tinyllama-1b-awq

Text Generation • Updated Oct 31, 2023 • 3.91k

Bomml/Llama-2-70B-chat-w4-g128-awq

Text Generation • Updated Sep 16, 2023

TheBloke/Llama-2-7B-Chat-AWQ

Text Generation • 1B • Updated Nov 9, 2023 • 4.45k • 23

TheBloke/Llama-2-7B-AWQ

Text Generation • 1B • Updated Nov 9, 2023 • 1.56k • 17

TheBloke/Llama-2-13B-AWQ

Text Generation • 2B • Updated Nov 9, 2023 • 1.25k • 14

TheBloke/CodeLlama-13B-Python-AWQ

Text Generation • 2B • Updated Nov 9, 2023 • 8 • 2

TheBloke/CodeLlama-13B-Instruct-AWQ

Text Generation • 2B • Updated Nov 9, 2023 • 1.35k • 9

TheBloke/CodeLlama-13B-AWQ

Text Generation • 2B • Updated Nov 9, 2023 • 1.18k • 4

TheBloke/Llama-2-13B-chat-AWQ

Text Generation • 2B • Updated Nov 9, 2023 • 3.92k • 26

TheBloke/Llama-2-70B-AWQ

Text Generation • 10B • Updated Nov 9, 2023 • 1.91k • 14