Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Benchmarking driving event classification & visual insights
Evaluate LLMs in different social science experiments
View and submit model evaluations for benchmarks
View and submit evaluations for language models
View and compare model ethical scores
Generate charts comparing model accuracy across different benchmarks
Display the latest submission results for the IEEE Low-Power Computer Vision Challenge leaderboard
View and refresh the IEEE Low-Power Computer Vision Challenge leaderboard
Explore language embedding model performance across datasets
View and submit LLM evaluations
Publish and analyze LLM model performance metrics
Generate Spring Training baseball stats leaderboard
Generate benchmark leaderboard for retrieval models
Fork of lmarena-ai/chatbot-arena-leaderboard
LLM performance on various tasks in Latvian
Display chatbot leaderboard data