DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 7 days ago • 104
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 65
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 110
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 14 days ago • 330
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 80