prudant's picture
Upload compressed model
0b4f016 verified
raw
history blame contribute delete
172 Bytes
default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
GPTQModifier:
targets: [Linear]
ignore: [lm_head]
scheme: W8A8