https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503

Q6_K_XL: Q6_K weights, untouched outputs, untouched embed

Fits 24K CTX on a 24GiB GPU

llama.cpp does not support exporting the vision components yet

Downloads last month
6
GGUF
Model size
23.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support