Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published 6 days ago β’ 236
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 211
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 328
view article Article I trained a Language Model to schedule events with GRPO! By anakin87 β’ Apr 29 β’ 86
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper β’ 2506.13585 β’ Published Jun 16 β’ 263
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 516
Running 3.12k 3.12k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters