The 20B oss model is already quantized using MXFP4. Wondering how 4bit bnb works on top of it?
· Sign up or log in to comment