Request: Int 4 GPTQ Autoround

by jart25 - opened 24 days ago

24 days ago

Hi, would it be possible to publish the model in this format, please? For example: kaitchup/Qwen3-30B-A3B-autoround-4bit-gptq.

It would be really helpful. Thanks a lot!

jpbwin

24 days ago

jart25

24 days ago

•

Doit using runpod

jart25 changed discussion status to closed 24 days ago

jart25 changed discussion status to open 23 days ago

jart25

23 days ago

The quant autoround is not supported on rocm on vllm

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment