Request to add more GGUF quantization size options

by makisekurisu-jp - opened Apr 30

Apr 30

I’d like to request the addition of more GGUF quantization size options to better support different hardware setups. Thank you!

makisekurisu-jp

May 1

@ND911 Q50 Q51 Q40 Q41 PLEASE

ND911

Owner May 4

sorry for taking so long but uploaded now

ND911 changed discussion status to closed May 4

AKDesign

May 7

•

edited May 7

Can you PLEASE add smaller Q2 version ?!?!?
At least Q2 versions are @6GB which is similar to SDXL models and can be run on most potato laptops!!!
Also can you make imatrix version for even smaller size 2-4GB ???

PS Q4 version @ 11GB is still fat pig of memory :(
NEED smaller models!

ND911

Owner May 8

Q2 up and afaik imatrix doesn't work in comfyui

AKDesign

May 8

That was fast, looking forward to testing it out, Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment