Qwen3-30B-A3B-Fp8-v1 / recipe.yaml
dsikka's picture
Upload folder using huggingface_hub
89d678b verified
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head, 're:.*mlp.gate$']
scheme: FP8