--- license: apache-2.0 language: - ja - en base_model: - cyberagent/Mistral-Nemo-Japanese-Instruct-2408 tags: - gptq --- W8A8-INT8 GPTQ + SmoothQuant quant of [cyberagent/Mistral-Nemo-Japanese-Instruct-2408](https://huggingface.co/cyberagent/Mistral-Nemo-Japanese-Instruct-2408) w/ [LLM Compressor](https://github.com/vllm-project/llm-compressor) 0.4.0 using [augmxnt/ultra-orca-boros-en-ja-v1](https://huggingface.co/datasets/augmxnt/ultra-orca-boros-en-ja-v1) as calibration set