Beinsezii
/

Mistral-Small-3.1-24B-Instruct-2503-Q6-K-U-GGUF

Q6_K_XL: Q6_K weights, untouched outputs, untouched embed

Fits 24K CTX on a 24GiB GPU

llama.cpp does not support exporting the vision components yet

GGUF

Model size

23.6B params

Architecture

llama

Hardware compatibility

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support