quant the qat...
#1
by
mmeyerlein
- opened
just for my understanding.
i.e. you changed the model to a new data format, but kept the weights as they were already 4bit, so there are no performance changes, right?
just for my understanding.
i.e. you changed the model to a new data format, but kept the weights as they were already 4bit, so there are no performance changes, right?