FrenzyBiscuit
/

Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

FrenzyBiscuit

AWQ Details

Model was quantized down to INT4 using GEMM Kernels.
Zero point quantization
Group size of 64

Downloads last month: 15

Safetensors

Model size

420M params

Tensor type

I32

·

BF16

·

FP16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for FrenzyBiscuit/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-AWQ

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-1.5B-Instruct

Finetuned

BeaverAI/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B

Quantized

(9)

this model