Update README.md
Browse files
README.md
CHANGED
@@ -133,6 +133,7 @@ Math-Reassoning
|
|
133 |
## Training and evaluation data
|
134 |
|
135 |
Training data curated from [collinear-ai/R1-Distill-SFT-Curated](https://huggingface.co/datasets/collinear-ai/R1-Distill-SFT-Curated)
|
|
|
136 |
|
137 |
## Training procedure
|
138 |
|
@@ -161,6 +162,10 @@ The following hyperparameters were used during training:
|
|
161 |
| 0.3174 | 0.3335 | 1247 | 0.3329 |
|
162 |
| 0.307 | 0.6670 | 2494 | 0.3169 |
|
163 |
|
|
|
|
|
|
|
|
|
164 |
|
165 |
### Framework versions
|
166 |
|
|
|
133 |
## Training and evaluation data
|
134 |
|
135 |
Training data curated from [collinear-ai/R1-Distill-SFT-Curated](https://huggingface.co/datasets/collinear-ai/R1-Distill-SFT-Curated)
|
136 |
+
Evaluation data: [HuggingFaceH4/MATH-500](https://huggingface.co/datasets/HuggingFaceH4/MATH-500)
|
137 |
|
138 |
## Training procedure
|
139 |
|
|
|
162 |
| 0.3174 | 0.3335 | 1247 | 0.3329 |
|
163 |
| 0.307 | 0.6670 | 2494 | 0.3169 |
|
164 |
|
165 |
+
### Evaluation on Math500
|
166 |
+
|
167 |
+

|
168 |
+
|
169 |
|
170 |
### Framework versions
|
171 |
|