Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
PradeepMoturi commited on
Commit
d25619a
·
verified ·
1 Parent(s): e1c4f2a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -1,6 +1,11 @@
1
  ---
2
  license: mit
3
  library_name: transformers
 
 
 
 
 
4
  ---
5
  # 🧮 Fathom-R1-14B: $499 Training Recipe for Unlocking Math Reasoning at o4-mini level using R1-distilled-14B model under 16K context
6
 
 
1
  ---
2
  license: mit
3
  library_name: transformers
4
+ datasets:
5
+ - FractalAIResearch/Fathom-V0.4-SFT-Shortest-Chains
6
+ - FractalAIResearch/Fathom-V0.6-Iterative-Curriculum-Learning
7
+ base_model:
8
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
9
  ---
10
  # 🧮 Fathom-R1-14B: $499 Training Recipe for Unlocking Math Reasoning at o4-mini level using R1-distilled-14B model under 16K context
11