No IQ2_XXS version?

#19

by xldistance - opened May 11

Discussion

xldistance

May 11

Please make a version of IQ2_XXS for 235b

luke9090

May 11

Maybe with UD 🙏

BVEsun

23 days ago

I also waiting UD version IQ2_XXS for 235B-A22B

shimmyshimmer

Unsloth AI org 22 days ago

Hey guys thanks for all the hype! We'll need to work with llama.cpp to get down to the issues with quantization of larger Qwen models

danielhanchen

Unsloth AI org 21 days ago

Sadly I don't think low bit I quants are possible - I tried and continuously tried debugging, but imatrix has nans and zeros - I'm going to have to work with the llama.cpp and Hugging Face folks and Qwen folks to get to the bottom of things!

For now, the quants we provide are the only "stable" ones! Apologies!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment