-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
mlx-community/Kimi-Dev-72B-4bit-DWQ
Text Generation
•
73B
•
Updated
•
1.68k
•
15
LGAI-EXAONE/EXAONE-4.0-32B-AWQ
Text Generation
•
5B
•
Updated
•
580
•
9
mlx-community/Devstral-Small-2507-4bit-DWQ
Text Generation
•
24B
•
Updated
•
282
•
8
LGAI-EXAONE/EXAONE-4.0-32B-GPTQ
Text Generation
•
5B
•
Updated
•
136
•
7
LGAI-EXAONE/EXAONE-4.0-1.2B-AWQ
Text Generation
•
0.4B
•
Updated
•
100
•
7
mlx-community/Kimi-K2-Instruct-4bit
Text Generation
•
1T
•
Updated
•
3.38k
•
7
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
453k
•
84
Qwen/Qwen-7B-Chat-Int4
Text Generation
•
2B
•
Updated
•
2.33k
•
74
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
249k
•
72
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
6B
•
Updated
•
150k
•
83
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
•
4B
•
Updated
•
3.29k
•
23
numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
•
3B
•
Updated
•
241
•
3
lmstudio-community/Magistral-Small-2506-MLX-4bit
Text Generation
•
4B
•
Updated
•
240k
•
14
SoybeanMilk/Kimi-VL-A3B-Thinking-2506-BNB-4bit
Image-Text-to-Text
•
9B
•
Updated
•
675
•
4
tencent/Hunyuan-A13B-Instruct-GPTQ-Int4
Text Generation
•
11B
•
Updated
•
67.2k
•
46
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text
•
8B
•
Updated
•
44.3k
•
7
Rainnighttram/GLM-4.1V-9B-Thinking-bnb-4bit
Image-to-Text
•
10B
•
Updated
•
1.51k
•
9
cpatonn/OpenCodeReasoning-Nemotron-1.1-32B-AWQ
Text Generation
•
6B
•
Updated
•
10
•
2
0xroyce/silent-voice-multimodal
8B
•
Updated
•
254
•
2
NLPnorth/snakmodel-7b-instruct-mlx-4bit
Text Generation
•
1B
•
Updated
•
14
•
2
TheBloke/WizardCoder-15B-1.0-GPTQ
Text Generation
•
3B
•
Updated
•
1.13k
•
178
TheBloke/WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ
Text Generation
•
4B
•
Updated
•
32
•
92
TheBloke/Wizard-Vicuna-13B-Uncensored-SuperHOT-8K-GPTQ
Text Generation
•
2B
•
Updated
•
1.05k
•
149
TheBloke/WizardLM-Uncensored-SuperCOT-StoryTelling-30B-SuperHOT-8K-GPTQ
Text Generation
•
4B
•
Updated
•
17
•
47
TheBloke/Nous-Hermes-Llama2-GPTQ
Text Generation
•
2B
•
Updated
•
1.01k
•
60
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
2B
•
Updated
•
6.83k
•
211
TheBloke/Nous-Capybara-7B-GPTQ
Text Generation
•
1B
•
Updated
•
16
•
4
TheBloke/Mistral-Pygmalion-7B-GPTQ
Text Generation
•
1B
•
Updated
•
22
•
10
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
1B
•
Updated
•
66
•
10
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
Text Generation
•
6B
•
Updated
•
20k
•
138