Prompt format:
ChatML

Note:
Set the additional settings as per the instructions in the image at the end of the card to use the thinking setup. [1]

Reasoning setup in SillyTavern:

Downloads last month: 2,097

GGUF

Model size

8.19B params

Architecture

qwen3

Hardware compatibility

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Lewdiculous/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-GGUF-IQ-Imatrix

Base model

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Finetuned

ArliAI/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small

Quantized

(18)

this model

Collection including Lewdiculous/DS-R1-Qwen3-8B-ArliAI-RpR-v4-Small-GGUF-IQ-Imatrix

Quantized Models (GGUF, IQ, Imatrix)

Collection

Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 98 items • Updated about 15 hours ago • 63