Re-quantize and re-upload model

by mtasic85 - opened 9 days ago

Discussion

mtasic85

9 days ago

@Lyte llama.cpp fixed issue with BOS/EOS tokens, and all models need to be re-quantized and re-uploaded.

Please check https://github.com/ggerganov/llama.cpp/issues/9315

Lyte

Owner 9 days ago

okay thanks for letting me know, I'll get to it asap!

Lyte

Owner 6 days ago

•

edited 6 days ago

@mtasic85 done using latest llama.cpp same as the other repo. feel free to try quantizing them yourself I've included the notebook i use to do this, it's the same one that i used for both RWKV models.

Lyte changed discussion status to closed 6 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment