BeaverLegacy
/

Smegmma-9B-v1-GGUF

GGUF

Not-For-All-Audiences

Inference Endpoints

conversational

Model card Files Files and versions Community

Problem with KV-cache

by SolidSnacke - opened Jul 9, 2024

Discussion

SolidSnacke

Jul 9, 2024

Is it normal that cache_4bit or cache_8bit does not work with this model? More precisely, if you use them, an error is thrown when loading the model. Used oobabooga.

opnopterix

Jul 19, 2024

Is it normal that cache_4bit or cache_8bit does not work with this model? More precisely, if you use them, an error is thrown when loading the model. Used oobabooga.

I can't answer if it's supposed to be like that, but I had to do the this to get it to run on koboldcpp (F16 off on kvcache) and llama.cpp (-nvko)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment