Running 123 123 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Explore and compare advanced language models on a new leaderboard