Model Card
- Base model:
Qwen/Qwen3-32B
- Quantization method: LNQ with GuidedQuant Hessian
- Target bit-width: 3
- Backend kernel: Any-Precision-LLM kernel (
ap-gemv
)
- Calibration data: RedPajama (1024 sentences / 4096 tokens)
- Calibration objective: Next-token prediction
- num_groups (for GuidedQuant Hessian): 1
How to run
References
Model tree for jusjinuk/Qwen3-32B-3bit-GuidedQuant-LNQ
Collection including
jusjinuk/Qwen3-32B-3bit-GuidedQuant-LNQ