This model is an int4 model with group_size 128 and symmetric quantization of Qwen/Qwen2-0.5B-Instruct generated by intel/auto-round algorithm.

鈿狅笍 Important: This model is used for internal testing with VLLM. Please do not delete or modify without approval.

Downloads last month
4,731
Safetensors
Model size
184M params
Tensor type
I32
BF16
F16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support