ISoloist1's picture
Update README.md
0dfc0b4 verified
metadata
license: apache-2.0
base_model:
  - google/madlad400-3b-mt
pipeline_tag: translation
tags:
  - text2text-generation
  - text-generation-inference

ONNX version of google/madlad400-3b-mt

Converted and quantized with optimum-cli

  • Convert to ONNX:
optimum-cli onnxruntime export --model google/madlad400-3b-mt <output_path> --legacy
  • Quantization:
optimum-cli onnxruntime quantize --onnx_model <input_model_path> -o <output_model_path> --avx512_vnni