Uploaded finetuned model
- Developed by: ertghiu256
- License: apache-2.0
- Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model information
This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.
Model purposes
- General reasoning
- Code (note: this model is not trained on html code, so the html code generated might look horible)
- Solving problems
Note: This model development is not from the deepseek team.
- Downloads last month
- 75
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for ertghiu256/deepseek-r1-0528-distilled-qwen3
Base model
Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B