Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Qwen
/
Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4

Text Generation
Transformers
Safetensors
English
qwen2_moe
chat
conversational
4-bit precision
gptq
Model card Files Files and versions
xet
Community
3
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Int4为什么比没量化的float32和float16还慢

3
#3 opened 3 months ago by
hujianmin

Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4 模型加载时间过长(近 2 小时)

#2 opened 11 months ago by
TimVan1

Not working with sample code

3
#1 opened about 1 year ago by
rupeshs
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs