Quantized models (4bit) request

by terminator33 - opened Apr 4

Discussion

terminator33

Apr 4

Can we please get a 4bit quantized model!!

MarsupialAI

Apr 4

LCPP does not support this model yet. I have an issue open with them on their github. I will be quantizing it as soon as it's resolved, and I have to assume others will as well.

smcleod

Apr 4

PR here: https://github.com/ggerganov/llama.cpp/pull/6491

sarahooker

Cohere For AI org Apr 5

We released a 4 bit quantized model here today: https://huggingface.co/CohereForAI/c4ai-command-r-plus-4bit. Enjoy!

sarahooker changed discussion status to closed Apr 5

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment