Uploaded finetuned model

  • Developed by: ertghiu256
  • License: apache-2.0
  • Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Model information

This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.

Model purposes

  • General reasoning
  • Code (note: this model is not trained on html code, so the html code generated might look horible)
  • Solving problems

Note: This model development is not from the deepseek team.

Downloads last month
75
Safetensors
Model size
4.02B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ertghiu256/deepseek-r1-0528-distilled-qwen3

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(154)
this model
Merges
1 model
Quantizations
1 model

Datasets used to train ertghiu256/deepseek-r1-0528-distilled-qwen3