mradermacher/RPT-DeepSeek-R1-0528-Qwen3-8B-GGUF

22 days ago

Thanks for this! It was in my pipeline and you saved me :)
Quick question, I used the gguf-my-repo space hosted on HF to convert a test Q8_0 quant, but it did not actually inject the chat template into the GGUF file, so when I tested through LMStudio which usually automatically extracts the chat template, it was not found, I had to manually add it from the original ninja file.
How did you manage to inject the chat template into all the quants? I know llama.cpp conversion usually handles this automatically in recent versions but I wondered why it was missing in the quant I created using the gguf-my-repo space: https://huggingface.co/spaces/ggml-org/gguf-my-repo

mradermacher

Owner 22 days ago

I have no idea, we don't have special magic to add a chat template, we also just use llama.cpp, i.e. convert_hf_to_gguf.py

ykarout

22 days ago

oh seems the like a bug from the gguf space… maybe outdated llama.pp version

mradermacher
/

RPT-DeepSeek-R1-0528-Qwen3-8B-GGUF

Great job