smol

GGUF Quants of Qwen/Qwen2-0.5B-Instruct

Built with llama.cpp using this version

There is an official GGUF quant, but it was missing the Q2_K quant.

Downloads last month
6
GGUF
Model size
494M params
Architecture
qwen2

2-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for cattoroboto/Qwen2-0.5B-Instruct-GGUF-Q2_K

Base model

Qwen/Qwen2-0.5B
Quantized
(51)
this model