How does 4bit bnb quantization work on MXFP4?

#3
by vijay120 - opened

The 20B oss model is already quantized using MXFP4. Wondering how 4bit bnb works on top of it?

Sign up or log in to comment