SN-COL Math Curators
Collection
3 items
โข
Updated
axolotl version: 0.5.0
This is an open-source fine-tuned reasoning adapter of microsoft/Phi-3.5-mini-instruct, transformed into a math reasoning model using data curated from collinear-ai/R1-Distill-SFT-Curated. It achieves the following results on the evaluation set:
This model is a LoRA adaptor and for best results merge it with base model microsoft/Phi-3.5-mini-instruct before use.
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
No log | 0.0003 | 1 | 0.6714 |
0.337 | 0.3335 | 1243 | 0.3361 |
0.3248 | 0.6669 | 2486 | 0.3203 |
The following figure shows the accuracy and the speedup of Collinear Curators C1 and C2 when compared to training on unfiltered dataset.
Base model
microsoft/Phi-3.5-mini-instruct