Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
josefonte
's Collections
Benchmarks
Benchmarks
updated
Mar 28
collection of datasets used to train and test MLMMs (VLMs)
Upvote
-
AI4Math/MathVerse
Viewer
•
Updated
May 15
•
4.73k
•
1.17k
•
58
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
33.5k
•
271
MMMU/MMMU_Pro
Viewer
•
Updated
Mar 8
•
5.19k
•
4.38k
•
29
AI4Math/MathVista
Viewer
•
Updated
Feb 11, 2024
•
6.14k
•
10.9k
•
162
MathLLMs/MathVision
Viewer
•
Updated
May 16
•
3.34k
•
13.3k
•
71
TIGER-Lab/MEGA-Bench
Viewer
•
Updated
May 7
•
7.69k
•
1.63k
•
21
lmms-lab/MMBench_EN
Viewer
•
Updated
Mar 8, 2024
•
11.1k
•
602
•
4
Lin-Chen/MMStar
Viewer
•
Updated
Apr 7, 2024
•
1.5k
•
9.37k
•
36
lmms-lab/MME
Viewer
•
Updated
Dec 23, 2023
•
2.37k
•
16.6k
•
21
MUIRBENCH/MUIRBENCH
Viewer
•
Updated
Jul 1, 2024
•
2.6k
•
1.78k
•
15
BLINK-Benchmark/BLINK
Viewer
•
Updated
Aug 13, 2024
•
3.81k
•
3.5k
•
27
OpenGVLab/CRPE
Viewer
•
Updated
Mar 21, 2024
•
544
•
208
•
9
ByteDance/MTVQA
Viewer
•
Updated
May 30, 2024
•
8.79k
•
267
•
34
lmms-lab/RealWorldQA
Viewer
•
Updated
Apr 13, 2024
•
765
•
5.93k
•
5
yifanzhang114/MME-RealWorld
Preview
•
Updated
Nov 14, 2024
•
1.34k
•
17
lmms-lab/MMVet
Viewer
•
Updated
Mar 8, 2024
•
218
•
2.03k
•
4
mistralai/MM-MT-Bench
Viewer
•
Updated
Oct 10, 2024
•
92
•
563
•
22
edinburgh-dawg/mmlu-redux
Viewer
•
Updated
Feb 9
•
3k
•
6.88k
•
32
TIGER-Lab/MMLU-Pro
Viewer
•
Updated
Apr 6
•
12.1k
•
49.4k
•
360
Idavidrein/gpqa
Viewer
•
Updated
Mar 28, 2024
•
1.25k
•
40.7k
•
188
openai/gsm8k
Viewer
•
Updated
Jan 4, 2024
•
17.6k
•
399k
•
799
openai/openai_humaneval
Viewer
•
Updated
Jan 4, 2024
•
164
•
58.5k
•
323
nuprl/MultiPL-E
Viewer
•
Updated
Feb 10
•
12.7k
•
9.65k
•
53
google/IFEval
Viewer
•
Updated
Aug 14, 2024
•
541
•
19.3k
•
72
opendatalab/OmniDocBench
Viewer
•
Updated
Feb 11
•
984
•
2.82k
•
26
wulipc/CC-OCR
Viewer
•
Updated
Dec 27, 2024
•
7.06k
•
658
•
1
lmms-lab/ai2d
Viewer
•
Updated
Mar 26, 2024
•
3.09k
•
8.35k
•
11
lmms-lab/textvqa
Viewer
•
Updated
Mar 8, 2024
•
45.3k
•
14.7k
•
11
lmms-lab/DocVQA
Viewer
•
Updated
Apr 18, 2024
•
16.6k
•
11.9k
•
44
HuggingFaceM4/ChartQA
Viewer
•
Updated
Mar 5, 2024
•
32.7k
•
6.51k
•
43
princeton-nlp/CharXiv
Viewer
•
Updated
Sep 27, 2024
•
2.32k
•
967
•
39
AILab-CVC/SEED-Bench-2-plus
Viewer
•
Updated
Apr 27, 2024
•
555
•
74
•
5
echo840/OCRBench
Viewer
•
Updated
Dec 18, 2024
•
1k
•
12.1k
•
15
lmms-lab/OCRBench-v2
Viewer
•
Updated
Feb 9
•
10k
•
442
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections