The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
-
allenai/Olmo-3.1-32B-Think
Text Generation β’ 32B β’ Updated β’ 4.49k β’ β’ 66 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B β’ Updated β’ 2.12k β’ 6 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation β’ 32B β’ Updated β’ 661 β’ 4 -
allenai/Olmo-3.1-32B-Instruct
Text Generation β’ 32B β’ Updated β’ 9.6k β’ β’ 46