-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
PrunaAI/TheDrummer-Smegmma-9B-v1-bnb-8bit-smashed
Updated
•
9
•
1
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
Updated
•
760k
•
39
RedHatAI/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
182
•
3
shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8
Updated
•
17
•
2
nm-testing/Mixtral-8x7B-Instruct-v0.1-W8A8-quantized
Updated
•
6
•
1
mlconvexai/jais-13b-chat_bitsandbytes_8bit
Text Generation
•
Updated
•
48
•
1
LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2
Updated
•
5
•
1
johnsnowlabs/JSL-MedLlama-3-8B-v17-8bits
Updated
•
7
•
1
MaziyarPanahi/Phi-3.5-mini-instruct-GGUF
Text Generation
•
Updated
•
559k
•
11
Statuo/NemoMix-Unleashed-EXL2-8bpw
Text Generation
•
Updated
•
63
•
4
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
2.13k
•
29
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
827
•
14
watsonchua/hansard-gemma-2-9b-lora
Updated
•
2
•
1
illuin/llama-3-grouse
Text Generation
•
Updated
•
1
•
1
MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF
Text Generation
•
Updated
•
771k
•
7
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
Updated
•
543k
•
5
qeternity/Mistral-Large-Instruct-2407-w8a8
Updated
•
6
•
1
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
1.3k
•
174
MaziyarPanahi/reader-lm-0.5b-GGUF
Text Generation
•
Updated
•
129
•
3
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
Updated
•
762k
•
25
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
939
•
11
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
749
•
9
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
724
•
3
Qwen/Qwen2.5-3B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
1.53k
•
3
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
18.1k
•
15
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
50.1k
•
20
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
6.69k
•
23
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
762k
•
5
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
790k
•
9
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
Updated
•
916
•
15