Running 1.21k 1.21k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes tiny, tiny2, small, base, large and large2 variants. • 8 items • Updated Jan 17 • 16
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • 14 days ago • 40
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 735