Running 514 514 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Running on CPU Upgrade 68 68 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.