EXL3 quantization of Qwen3-4B, 6 bits per weight.

HumanEval (argmax)

Model Q4 Q6 Q8 FP16
Qwen3-4B-exl3-4bpw 80.5% 81.1% 81.7% 80.5%
Qwen3-4B-exl3-6bpw 80.5% 85.4% 86.0% 86.0%
Qwen3-4B-exl3-8bpw-h8 82.3% 84.8% 83.5% 82.9%
Downloads last month
0
Safetensors
Model size
1.9B params
Tensor type
FP16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for isogen/Qwen3-4B-exl3-6bpw

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Quantized
(73)
this model