Make 8 bit quantized model version

#14

by nonetrix - opened Jan 12, 2024

Jan 12, 2024

•

It would be nice if there was a 8 bit gguff version as well for less compressed model

Feb 24, 2024

This comment has been hidden

Feb 24, 2024

This comment has been hidden

Apr 5, 2024

It would be nice if there was a 8 bit gguff version as well for less compressed model

where you able to run it with llama.cpp?
can you share some code example?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment