One RL to See Them All: Visual Triple Unified Reinforcement Learning Paper • 2505.18129 • Published 13 days ago • 59 • 2
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 293 • 6
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23 • 4
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23 • 4