SN-COL Math Curators
Collection
3 items
โข
Updated
This is an open-source fine-tuned reasoning adapter of microsoft/Phi-3.5-mini-instruct, transformed into a math reasoning model using data curated from collinear-ai/R1-Distill-SFT-Curated.
axolotl version: 0.5.0
Math-Reassoning
Training data curated from collinear-ai/R1-Distill-SFT-Curated Evaluation data: HuggingFaceH4/MATH-500
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
No log | 0.0003 | 1 | 0.6646 |
0.3174 | 0.3335 | 1247 | 0.3329 |
0.307 | 0.6670 | 2494 | 0.3169 |
Base model
microsoft/Phi-3.5-mini-instruct