rabiulawal commited on
Commit
32feb5f
·
verified ·
1 Parent(s): 9738275

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -4,7 +4,7 @@ base_model:
4
  ---
5
  # EARL - RL Fine-tuned (S + C) thinking (8B)
6
 
7
- **Model Name:** `mair-lab/thinking-sft-simple.rl-simple-n-complex`
8
  **Model Size:** 8B parameters
9
  **Base Checkpoint:** [`mair-lab/sft-think-simple`](https://huggingface.co/mair-lab/sft-think-simple)
10
  **Training Method:** Supervised Fine-Tuning (SFT think (S)) → Reinforcement Learning (RL) on Simple + Complex Edits
 
4
  ---
5
  # EARL - RL Fine-tuned (S + C) thinking (8B)
6
 
7
+ **Model Name:** `mair-lab/earl-thinking-sft-simple.rl-simple-n-complex`
8
  **Model Size:** 8B parameters
9
  **Base Checkpoint:** [`mair-lab/sft-think-simple`](https://huggingface.co/mair-lab/sft-think-simple)
10
  **Training Method:** Supervised Fine-Tuning (SFT think (S)) → Reinforcement Learning (RL) on Simple + Complex Edits