CHE-72-ZLab/Alibaba-Qwen2-7B-Instruct-GGUF

This model was converted to GGUF format from Qwen/Qwen2-7B-Instruct using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Downloads last month
34
GGUF
Model size
7.62B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for CHE-72-ZLab/Alibaba-Qwen2-7B-Instruct-GGUF

Base model

Qwen/Qwen2-7B
Quantized
(74)
this model