-
-
-
-
-
-
Inference Providers
Active filters:
sparsity
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
5
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
6
•
1
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
39
•
62
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
9
•
3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Text Generation
•
8B
•
Updated
•
10
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4
Text Generation
•
8B
•
Updated
•
5
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
6
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
6
bartowski/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
1.3k
•
3
QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
335
•
4
tensorblock/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
275
nintwentydo/pixtral-12b-2409-2of4-sparse
Image-Text-to-Text
•
13B
•
Updated
•
1
HangGuo/Llama2-70B-QuaRot-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
26
•
1
HangGuo/Llama2-70B-QuaRot-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama2-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama2-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
7
HangGuo/Llama3-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
7
HangGuo/Llama3-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
4
HangGuo/Llama3-70B-QuaRot-OBR-RTN-W4A4KV16S50
Text Generation
•
Updated
•
6
HangGuo/Llama3-70B-QuaRot-OBR-GPTQ-W4A4KV16S50
Text Generation
•
Updated
•
5
HangGuo/QWen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
6
HangGuo/QWen2.5-32B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
7
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
3
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
5
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
16
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
9