EXL3 quantization of Mistral-Nemo-Instruct-2407, 6 bits per weight.

Downloads last month
8
Safetensors
Model size
5.02B params
Tensor type
FP16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support