UQFF
Collection
UQFF models. Examples for each in the model card!
β’
31 items
β’
Updated
β’
17
Qwen/Qwen3-4B
, UQFF quantization
Run with mistral.rs. Documentation: UQFF docs.
Quantization type(s) | Example |
---|---|
AFQ2 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-afq2-0.uqff |
AFQ3 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-afq3-0.uqff |
AFQ4 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-afq4-0.uqff |
AFQ6 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-afq6-0.uqff |
AFQ8 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-afq8-0.uqff |
F8E4M3 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-f8e4m3-0.uqff |
Q2K | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-q2k-0.uqff |
Q3K | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-q3k-0.uqff |
Q4K | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-q4k-0.uqff |
Q5K | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-q5k-0.uqff |
Q8_0 | ./mistralrs-server -i plain -m EricB/Qwen3-4B-UQFF -f qwen34b-q8_0-0.uqff |