Text Generation
Transformers
PyTorch
Safetensors
llama
text-generation-inference

Would be nice to provide quantized version like those by https://huggingface.co/TheBloke

#17
by tigerinus - opened

Preferrably GPTQ. Thanks.

Hi @tigerinus ,

Doesn't the quantized models provided by TheBloke work for you?

Hi @tigerinus ,

Doesn't the quantized models provided by TheBloke work for you?

Wait... R u saying TheBloke provides the quantized version of Yi models?

tigerinus changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment