Quantise to support llama.cpp

by TusharRay - opened Feb 22, 2024

Feb 22, 2024

Can this be quantised to support https://github.com/ggerganov/llama.cpp ? The llama.cpp is really performant and this model can then be widely used across multiple platforms!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment