Reinforcement Learning Teachers Collection Students distilled from a 7B Reinforcement-Learned Teacher (RLT) from the paper "Reinforcement Learning Teachers of Test Time Scaling." • 2 items • Updated Jun 22 • 8
Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24 Text Generation • 12B • Updated Oct 25, 2024 • 693k • 126
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 21 days ago • 110
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 151