Qwen3-0.6B-Base-FP8-Dynamic / generation_config.json
CarlOwOs's picture
Add FP8 dynamically quantized Qwen3-0.6B-Base model using llm-compressor
9dce9b8 verified
raw
history blame contribute delete
117 Bytes
{
"bos_token_id": 151643,
"eos_token_id": 151643,
"max_new_tokens": 2048,
"transformers_version": "4.52.3"
}