YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

license: apache-2.0 license_link: https://huggingface.co/Qwen/Qwen3-30B-A3B/blob/main/LICENSE pipeline_tag: text-generation tags:

  • Qwen3
  • gptq
  • int4
  • 量化
  • vLLM base_model:
    • Qwen/Qwen3-30B-A3B base_model_relation: quantized

通义千问Qwen3-30B-A3B-GPTQ-Int8量化

基础模型 通义千问3-30B-A3B

最近更新

2025-05-08
fix (model.layers.*.mlp.gate) are not quantized

依赖

vllm==0.8.5

SDK下载

#安装ModelScope
pip install modelscope
#SDK模型下载
from modelscope import snapshot_download
model_dir = snapshot_download('JunHowie/Qwen3-30B-A3B-GPTQ-Int8')

Git下载

#Git模型下载
git clone https://www.modelscope.cn/JunHowie/Qwen3-30B-A3B-GPTQ-Int8.git

如果您是本模型的贡献者,我们邀请您根据模型贡献文档,及时完善模型卡片内容。

Downloads last month
1,399
Safetensors
Model size
8.42B params
Tensor type
I32
·
BF16
·
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support