Arcana Qwen3-2.4B-A0.6B
Collection
Qwen3 MoE model
•
5 items
•
Updated
•
1
This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its mathematical problem-solving and reasoning capabilities. Training was conducted exclusively on the OpenMathReasoning-mini
dataset, and the model was optimized using the bfloat16 (bf16) data type.
Dataset Preparation
unsloth/OpenMathReasoning-mini
dataset was used.Model Loading and Configuration
unsloth
library in bf16 precision.full_finetuning=True
) to adapt the model for mathematical reasoning.Supervised Fine-Tuning
This project is licensed under the Apache License 2.0. See the LICENSE file for details.
Base model
Qwen/Qwen3-0.6B-Base