llama.cpp breaks quantized ggml file format

#11

by Waldschrat - opened May 16, 2023

May 16, 2023

llama.cpp decided to break the quantized ggml file format: https://github.com/ggerganov/llama.cpp/pull/1305

As nobody seems to be able (or willing) to provide a conversion script, the models need to be requantized (is that even a word?) from the source models.

As this is quite a hurdle for people new into the field (like me), so: May I ask you to please quantize and upload the models in the new format?

venketh

May 17, 2023

I'll take a swing at it - https://huggingface.co/venketh/GPT4-X-Alpaca-30B-4bit-ggml

MetaIX

Owner May 18, 2023

Updated the quants and added q5_0

concedo

May 26, 2023

Might have to be updated again heh

MetaIX

Owner May 26, 2023

Updating again today...

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment