IQ3_XXS use more VRAM

#5
by mahdisml - opened

Hi ! i tried IQ3_XXS but something wrong happened .

{7DB5A7C6-9192-4C6A-A697-3EEE85B018B8}.png

Size of IQ3_XXS is 4.78GB but when i use it in lmstudio it takes up 6.5 GB VRAM. I've tried a lot of models before, but I've never seen this issue.
Is this normal ?

before load :

{441F48C7-8CD8-418B-8C0B-6D4D85B4B457}.png

after load :
{07437FFF-B983-47E6-B8CC-E3DFC531FB03}.png


settings :

{A0D1316D-BBCB-486E-8078-0DDA8AA08644}.png

Yes, models will use more RAM after being loaded due to a variety of reasons, most notably context and overhead

This is about what you'd expect from a typical model, 2-4GB extra

Thank you so much! 🙏✨

Sign up or log in to comment