ertghiu256
/

deepseek-r1-0528-distilled-qwen3

Text Generation

text-generation-inference

Model card Files Files and versions Community

Uploaded finetuned model

Developed by: ertghiu256
License: apache-2.0
Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Model information

This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.

Model purposes

General reasoning
Code (note: this model is not trained on html code, so the html code generated might look horible)
Solving problems

Note: This model development is not from the deepseek team.

Downloads last month: 75

Safetensors

Model size

4.02B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ertghiu256/deepseek-r1-0528-distilled-qwen3

Base model

Qwen/Qwen3-4B-Base

Finetuned

Finetuned

(154)

this model

Merges

1 model

Quantizations

1 model

Datasets used to train ertghiu256/deepseek-r1-0528-distilled-qwen3