EXL3 quantization of Reka Flash 3, 3 bits per weight.

Requires #20.

Downloads last month
4
Safetensors
Model size
4.55B params
Tensor type
FP16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support