quant the qat...

#1
by mmeyerlein - opened

just for my understanding.
i.e. you changed the model to a new data format, but kept the weights as they were already 4bit, so there are no performance changes, right?

Sign up or log in to comment