-
-
-
-
-
-
Inference Providers
Active filters:
vllm
huihui-ai/Devstral-Small-2505-abliterated
Text2Text Generation
•
Updated
•
50
•
4
Mungert/Devstral-Small-2505-GGUF
Text2Text Generation
•
Updated
•
649
•
1
RedHatAI/gemma-3-27b-it-quantized.w8a8
Image-Text-to-Text
•
Updated
•
1
Inferless/deciLM-7B-GPTQ
Text Generation
•
Updated
•
38
•
1
Inferless/SOLAR-10.7B-Instruct-v1.0-GPTQ
Text Generation
•
Updated
•
26
•
2
Inferless/Mixtral-8x7B-v0.1-int8-GPTQ
Text Generation
•
Updated
•
31
•
2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
Updated
•
2.93k
•
23
RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8
Text Generation
•
Updated
•
99
•
3
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
•
Updated
•
4.95k
•
8
RedHatAI/Meta-Llama-3-70B-Instruct-FP8
Text Generation
•
Updated
•
8.83k
•
12
RedHatAI/Qwen2-72B-Instruct-FP8
Text Generation
•
Updated
•
1.51k
•
15
RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8
Text Generation
•
Updated
•
17
•
3
RedHatAI/Qwen2-0.5B-Instruct-FP8
Text Generation
•
Updated
•
1.16k
•
3
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
•
Updated
•
4.71k
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
•
Updated
•
15.4k
•
2
nm-testing/SparseLlama-3-8B-pruned_50.2of4-FP8
Text Generation
•
Updated
•
18
FlorianJc/Hermes-2-Pro-Mistral-7B-vllm-fp8
Text Generation
•
Updated
•
47
FlorianJc/openchat-3.6-8b-20240522-vllm-fp8
Text Generation
•
Updated
•
18
FlorianJc/Llama3-ChatQA-1.5-8B-vllm-fp8
Text Generation
•
Updated
•
25
RedHatAI/Meta-Llama-3-70B-Instruct-FP8-KV
Text Generation
•
Updated
•
70
•
2
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
•
Updated
•
2.09k
•
2
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
•
Updated
•
689
RedHatAI/Phi-3-mini-128k-instruct-FP8
Text Generation
•
Updated
•
28
RedHatAI/Phi-3-medium-128k-instruct-FP8
Text Generation
•
Updated
•
36
•
5
FlorianJc/google-gemma-2-9b-it-vllm-fp8
Text Generation
•
Updated
•
19
•
1
tranhoangnguyen03/Gemma-2-9B-It-SPPO-Iter3_Q8
Text Generation
•
Updated
•
12
FlorianJc/Llama3-ChatQA-1.5-8B-v2-vllm-fp8
Text Generation
•
Updated
•
15
FlorianJc/MegaBeam-Mistral-7B-300k-vllm-fp8
Text Generation
•
Updated
•
15
RedHatAI/gemma-2-9b-it-FP8
Text Generation
•
Updated
•
786
•
5
nm-testing/Llama-2-70b-chat-hf-FP8
Text Generation
•
Updated
•
14