Post
925
AReaL-boba² 🔥 A fully async RL system by Ant Research & Tsinghua.
Paper: AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning (2505.24298)
Model:
inclusionAI/areal-boba-2-683f0e819ccb7bb2e1b2f2d5
✨ 8B/14B/32B models, datasets & paper – all on the hub
✨ 2.77× faster training
✨ Native Agentic RL support
Paper: AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning (2505.24298)
Model:
inclusionAI/areal-boba-2-683f0e819ccb7bb2e1b2f2d5
✨ 8B/14B/32B models, datasets & paper – all on the hub
✨ 2.77× faster training
✨ Native Agentic RL support