Running 2.36k 2.36k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Running 540 540 Scaling test-time compute π Enhance math problem solving by scaling test-time compute
Running on CPU Upgrade 68 68 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running 116 116 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation
Running 224 224 AI2 WildBench Leaderboard (V2) π¦ Display and explore model leaderboards and chat history
Running 896 896 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training