Spaces:

OctoThinker
/

README

Running

koalazf99 commited on 14 days ago

Commit

b1d78ad

verified ·

1 Parent(s): 1dff1a7

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,4 +7,5 @@ sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 pinned: false
 ---
+🐙 OctoThinker, led by [GAIR](https://huggingface.co/GAIR), is an initiative to explore earlier training interventions that make base models more amenable to reinforcement learning (RL) scaling.
+🎯 Our Goal: To reshape the pre-training trajectory so models scale better under RL.