Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,7 @@ base_model:
|
|
| 4 |
---
|
| 5 |
# EARL - RL Fine-tuned (S + C) thinking (8B)
|
| 6 |
|
| 7 |
-
**Model Name:** `mair-lab/thinking-sft-simple.rl-simple-n-complex`
|
| 8 |
**Model Size:** 8B parameters
|
| 9 |
**Base Checkpoint:** [`mair-lab/sft-think-simple`](https://huggingface.co/mair-lab/sft-think-simple)
|
| 10 |
**Training Method:** Supervised Fine-Tuning (SFT think (S)) → Reinforcement Learning (RL) on Simple + Complex Edits
|
|
|
|
| 4 |
---
|
| 5 |
# EARL - RL Fine-tuned (S + C) thinking (8B)
|
| 6 |
|
| 7 |
+
**Model Name:** `mair-lab/earl-thinking-sft-simple.rl-simple-n-complex`
|
| 8 |
**Model Size:** 8B parameters
|
| 9 |
**Base Checkpoint:** [`mair-lab/sft-think-simple`](https://huggingface.co/mair-lab/sft-think-simple)
|
| 10 |
**Training Method:** Supervised Fine-Tuning (SFT think (S)) → Reinforcement Learning (RL) on Simple + Complex Edits
|