smol

Built with llama.cpp using this version

There is an official GGUF quant, but it was missing the Q2_K quant.

GGUF

Model size

494M params

Architecture

qwen2

2-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for cattoroboto/Qwen2-0.5B-Instruct-GGUF-Q2_K

Base model

Qwen/Qwen2-0.5B

Finetuned

Quantized

(55)

this model