Exl2 quants for Qwen1.5-0.5B-Chat
Automatically quantized using the auto quant script from hf-scripts
Created as an example for Auto EXL2 HF upload
BPW:
Inference API (serverless) does not yet support ExLlamaV2 models for this pipeline type.
Created as an example for Auto EXL2 HF upload