-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
695k
•
5
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
722k
•
9
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
Updated
•
873
•
15
Qwen/Qwen2.5-Coder-1.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
66
•
2
Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
21.6k
•
4
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
Updated
•
371k
•
11
MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
Text Generation
•
Updated
•
726k
•
9
qeternity/Qwen2.5-72B-Instruct-W8A8
Updated
•
14
•
3
noneUsername/Mistral-Small-Instruct-2409-W8A8-Dynamic-Per-Token
Updated
•
4
•
1
malenia1/ternary-weight-embedding
Updated
•
31
•
7
abdulmannan-01/qwen-2.5-7b-finetuned-for-sql-generation-bnb-8bit
Text Generation
•
Updated
•
37
•
1
MaziyarPanahi/Qwen2.5-7B-Instruct-abliterated-v2-GGUF
Text Generation
•
Updated
•
98
•
3
vector-institute/Llama3.2-Multimodal-Newsmedia-Bias-Detector
Updated
•
24
•
1
MaziyarPanahi/llm_3_2_flux_prompt-GGUF
Text Generation
•
Updated
•
190
•
2
akhmat-s/t5-large-quant-grammar-corrector
Text2Text Generation
•
Updated
•
8
•
1
MaziyarPanahi/Llama-3.2-1B-GGUF
Text Generation
•
Updated
•
107
•
1
radlab/pLLama3.1-8B-content
MaziyarPanahi/SmolLM2-135M-Instruct-GGUF
Text Generation
•
Updated
•
60
•
2
AIFunOver/stable-diffusion-3.5-large-turbo-openvino-8bit
Text-to-Image
•
Updated
•
27
•
1
Qwen/Qwen2.5-Coder-0.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
454
•
1
Qwen/Qwen2.5-Coder-3B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
183
•
1
Qwen/Qwen2.5-Coder-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
1.68k
•
5
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
5.44k
•
20
mlx-community/Qwen2.5-Coder-32B-Instruct-8bit
Text Generation
•
Updated
•
145
•
9
mlx-community/Qwen2.5-Coder-32B-8bit
Text Generation
•
Updated
•
25
•
4
lmstudio-community/Qwen2.5-32B-Instruct-MLX-8bit
Text Generation
•
Updated
•
82
•
1
lmstudio-community/Qwen2.5-Coder-32B-Instruct-MLX-8bit
Text Generation
•
Updated
•
226
•
3
prithivMLmods/Qwen2.5-Coder-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
58
•
3
prithivMLmods/Llama-3.2-3B-GGUF
Text Generation
•
Updated
•
86
•
2
AIFunOver/FLUX.1-dev-openvino-8bit
Text-to-Image
•
Updated
•
1