Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,11 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
4 |
---
|
5 |
# 🧮 Fathom-R1-14B: $499 Training Recipe for Unlocking Math Reasoning at o4-mini level using R1-distilled-14B model under 16K context
|
6 |
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
library_name: transformers
|
4 |
+
datasets:
|
5 |
+
- FractalAIResearch/Fathom-V0.4-SFT-Shortest-Chains
|
6 |
+
- FractalAIResearch/Fathom-V0.6-Iterative-Curriculum-Learning
|
7 |
+
base_model:
|
8 |
+
- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
|
9 |
---
|
10 |
# 🧮 Fathom-R1-14B: $499 Training Recipe for Unlocking Math Reasoning at o4-mini level using R1-distilled-14B model under 16K context
|
11 |
|