try Yi-6B by llama.cpp

#11

by ztime - opened Nov 7, 2023

ztime

Nov 7, 2023

now, you can try this at :

Nov 7, 2023

Nice try! GGUF seems add a BOS token in front of prompt by default (which is not used by Yi base models), is this app dealing with that?

ztime

Nov 7, 2023

Nice try! GGUF seems add a BOS token in front of prompt by default (which is not used by Yi base models), is this app dealing with that?

Indeed, I updated a version now and removed the BOS token

Nov 9, 2023

@ztime Can you share the Yi-6B-GUFF file with us?
My T4 can't successfully run the Yi-34B model, even with 2bits quant.

ztime

Nov 9, 2023

•

@ztime Can you share the Yi-6B-GUFF file with us?
My T4 can't successfully run the Yi-34B model, even with 2bits quant.

you can find 6b gguf file at here : https://huggingface.co/SamPurkis/Yi-6B-GGUF
mac m1 chip 32GB ram can infre Yi-34B gguf by llama.cpp

FancyZhao changed discussion status to closed Nov 16, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment