-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
•
53.4k
•
73
FunAGI/Qwen2.5-Omni-7B-GPTQ-4bit
Any-to-Any
•
Updated
•
2.59k
•
39
Qwen/Qwen2.5-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
Updated
•
27.7k
•
32
mlx-community/deepcogito-cogito-v1-preview-qwen-32B-4bit
Text Generation
•
Updated
•
93
•
5
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
Updated
•
428k
•
68
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
•
88.1k
•
51
Qwen/QwQ-32B-AWQ
Text Generation
•
Updated
•
166k
•
111
mlx-community/Kimi-VL-A3B-Thinking-4bit
Text Generation
•
Updated
•
66
•
4
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
Updated
•
767k
•
44
John6666/Llama-3.1-8B-Lexi-Uncensored-V2-nf4
Text Generation
•
Updated
•
8.52k
•
17
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-GPTQ-4bit
Updated
•
36
•
5
unsloth/gemma-3-12b-it-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
•
105k
•
8
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
•
Updated
•
12.2k
•
15
stelterlab/openhands-lm-32b-v0.1-AWQ
Text Generation
•
Updated
•
2.29k
•
7
axolotl-quants/Llama-4-Scout-17B-16E-Linearized-bnb-nf4-bf16
Image-Text-to-Text
•
Updated
•
3.89k
•
3
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
Updated
•
4.14k
•
201
lllyasviel/omost-llama-3-8b-4bits
Text Generation
•
Updated
•
14.4k
•
21
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
Updated
•
766k
•
19
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
14.4k
•
19
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
71.3k
•
34
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
14.3k
•
37
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
•
Updated
•
66.3k
•
22
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
Updated
•
70.3k
•
70
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
136k
•
16
mlx-community/DeepSeek-R1-Distill-Qwen-14B-4bit
Valdemardi/DeepSeek-R1-Distill-Qwen-32B-AWQ
Text Generation
•
Updated
•
13.3k
•
28
mlx-community/DeepSeek-R1-4bit
Updated
•
1.91k
•
31
mlx-community/DeepSeek-R1-Distill-Llama-70B-4bit
Text Generation
•
Updated
•
466
•
8
graelo/Qwen2.5-7B-Instruct-1M-AWQ
unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
•
33.8k
•
28