Running on CPU Upgrade 366 366 GAIA Leaderboard ๐ฆพ Submit models for evaluation and view leaderboard
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots
Running 92 92 Nexus Function Calling Leaderboard ๐ Visualize model performance on function calling tasks