Chatbot Arena Leaderboard
Display chatbot leaderboard and stats
Display chatbot leaderboard and stats
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Request evaluation for a speech model
Explore LLM performance across hardware
Search and submit code models for evaluation
Can AI Code? An LLM leaderboard inclquantized models.
View and submit LLM evaluations
View and submit machine learning model evaluations
Analyze images to detect and label objects
Evaluate LLM cybersecurity risks
View LLM Performance Leaderboard
Explore and compare QA and long doc benchmarks
VLMEvalKit Evaluation Results Collection
Display and filter model evaluation results
Explore and analyze code evaluation data
Display and filter multimodal model leaderboard results
Display text-to-text translation interface
Visualize Open vs. Proprietary LLM Progress
Vote on AI responses to rank models
Blind vote on HF TTS models!
A leaderboard for LLMs powering smolagents