-
-
-
-
-
-
Inference Providers
Active filters:
awq
tencent/Hunyuan-0.5B-Instruct-AWQ-Int4
Text Generation
•
0.2B
•
Updated
•
60
•
2
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
4.07k
•
8
Valdemardi/DeepSeek-R1-Distill-Qwen-32B-AWQ
Text Generation
•
6B
•
Updated
•
2.72k
•
35
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
•
458k
•
55
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
•
47.6k
•
64
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
205k
•
103
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
119k
•
34
bullerwins/Qwen3-30B-A3B-awq
5B
•
Updated
•
5
•
2
cpatonn/Devstral-Small-2507-AWQ-4bit
Text Generation
•
4B
•
Updated
•
6.14k
•
6
btbtyler09/Devstral-Small-2507-AWQ
Text Generation
•
4B
•
Updated
•
300
•
2
TMElyralab/DeepSeek-R1-0528-AWQ-W4AFP8
Text Generation
•
Updated
•
31
•
2
openbmb/MiniCPM-V-4_5-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
2.28k
•
5
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
Updated
•
215
•
1
groxaxo/Qwen3-32B-AWorld-W8A16
9B
•
Updated
•
32
•
1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
16
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
8
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
6
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
9
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
14
casperhansen/opt-125m-awq
Text Generation
•
0.1B
•
Updated
•
1.33k
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
3.91k
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
4.45k
•
23
TheBloke/Llama-2-7B-AWQ
Text Generation
•
1B
•
Updated
•
1.56k
•
17
TheBloke/Llama-2-13B-AWQ
Text Generation
•
2B
•
Updated
•
1.25k
•
14
TheBloke/CodeLlama-13B-Python-AWQ
Text Generation
•
2B
•
Updated
•
8
•
2
TheBloke/CodeLlama-13B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
1.35k
•
9
TheBloke/CodeLlama-13B-AWQ
Text Generation
•
2B
•
Updated
•
1.18k
•
4
TheBloke/Llama-2-13B-chat-AWQ
Text Generation
•
2B
•
Updated
•
3.92k
•
26
TheBloke/Llama-2-70B-AWQ
Text Generation
•
10B
•
Updated
•
1.91k
•
14