view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 4 days ago • 479
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 475
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl By axolotl-ai-co and 1 other • Apr 4 • 10
view article Article Open-Source Handwritten Signature Detection Model By samuellimabraz • Mar 14 • 114
Running 2.8k 2.8k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
HF Deep RL Course Collection Models cooked in HF Deep RL Course (https://huggingface.co/learn/deep-rl-course) • 1 item • Updated Feb 7
Running 573 573 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute