This model is an int4 model with group_size 128 and symmetric quantization of Qwen/Qwen2-0.5B-Instruct generated by intel/auto-round algorithm.

⚠️ Important: This model is used for internal testing with VLLM. Please do not delete or modify without approval.

Safetensors

Model size

184M params

Tensor type

I32

BF16

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support