I'm curious how did you convert it to GGUF ?
#3
by
prayanksai
- opened
The org version is multi-modal ; looks like LLamacpp needs an update to work with it.
Suggest you submit a ticket at Llamacpp/Github asap.
RE: Quants.
Used a "bootleg" version of the source files with "vision" components removed.
Someone converted the VLLM to safetensors with config files and I used that to create the GGUFs.
Source: (this is one version)
https://huggingface.co/mrfakename/mistral-small-3.1-24b-instruct-2503-hf