AI2 WildBench Leaderboard (V2)
Display and explore model leaderboards and chat history
Display and explore model leaderboards and chat history
Display chatbot arena leaderboard and statistics
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Determine GPU requirements for large language models
Identify key entities in text
Browse and filter leaderboard of language models
Generate text from document images
Analyze document layout from images
Extract and recognize text from documents
Answer questions about images by chatting
Efficient quantized retrieval over Wikipedia
Display and filter reward model evaluation data
Identify objects in images based on text descriptions
Analyze images to detect and label objects
VLMEvalKit Evaluation Results Collection
Run a Streamlit web app
Visualize Open vs. Proprietary LLM Progress
Upload a PDF and ask questions to get insights
Submit and evaluate AI models on a leaderboard
Identify and highlight key entities in text
Explore and analyze code evaluation data
Create a Hugging Face dataset from text files
Generate speech from text in multiple languages
Analyze images to generate captions, detect objects, or perform OCR
Generate React TypeScript App
Video captioning/tracking
Display visual document retrieval leaderboard
In-browser speech recognition w/ word-level timestamps
Generate insights from charts using text prompts
Need to analyze data? Let a Llama-3.1 agent do it for you!
Launch MTEB Arena to compare models
View and submit language model evaluations
Detect objects in images using text prompts
VLMEvalKit Eval Results in video understanding benchmark
Extract text from images using various OCR modes
Display and filter leaderboard results for LLM judges
remove background from any image
Vote on AI responses to rank models
What happened in open-source AI this year, and whatβs next?
Generate interactive React app data visualizations
Detect and annotate poses in images and videos
Generate code solutions interactively
Ranking of LLMs for agentic tasks
OmniParser, turn your LLM into GUI agent
Enhance low-light images to improve clarity
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Handwritten Signature Detection
Generate document text from images using prompts
Generate text and speech responses from text, images, or audio input
Detect faces in uploaded images
Convert PDFs to Markdown with open-source parsers