-
opencompass/CompassJudger-1-32B-Instruct
Text Generation β’ 33B β’ Updated β’ 56 β’ 16 -
opencompass/CompassJudger-1-14B-Instruct
Text Generation β’ 15B β’ Updated β’ 22 β’ 2 -
opencompass/CompassJudger-1-7B-Instruct
8B β’ Updated β’ 1.78k β’ 9 -
opencompass/CompassJudger-1-1.5B-Instruct
2B β’ Updated β’ 15 β’ 1
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
π join us on Discord and WeChat
follow us on Github
OpenCompass is a platform focused on evaluation of AGI, include Large Language Model and Multi-modality Model. We aim to:
- develop high-quality libraries to reduce the difficulties in evaluation
- provide convincing leaderboards for improving the understanding of the large models
- create powerful toolchains targeting a variety of abilities and tasks
- build solid benchmarks to support the large model research
-
opencompass/CompassJudger-1-32B-Instruct
Text Generation β’ 33B β’ Updated β’ 56 β’ 16 -
opencompass/CompassJudger-1-14B-Instruct
Text Generation β’ 15B β’ Updated β’ 22 β’ 2 -
opencompass/CompassJudger-1-7B-Instruct
8B β’ Updated β’ 1.78k β’ 9 -
opencompass/CompassJudger-1-1.5B-Instruct
2B β’ Updated β’ 15 β’ 1
spaces
14
pinned
Running
26
Openvlm Subjective Leaderboard
π
VLMEvalKit Subjectivce Benchmark Results
pinned
Running
2
CompassAcademic Leaderboard Full Version
π¦
Compass Academic Leaderboard Full Version
pinned
Running
38
Open LMM Reasoning Leaderboard
π₯
A Leaderboard that demonstrates LMM reasoning capabilities
pinned
Running
6
Compass Academic Leaderboard
π¦
Compass Academic Leaderboard
pinned
Running
on
CPU Upgrade
804
Open VLM Leaderboard
π
VLMEvalKit Evaluation Results Collection
pinned
Running
21
MMBench Leaderboard
π
View and filter MMBench leaderboard data
models
8

opencompass/anah-7b
Text Classification
β’
8B
β’
Updated
β’
34

opencompass/anah-20b
Text Classification
β’
20B
β’
Updated
β’
26

opencompass/anah-v2
Text Classification
β’
8B
β’
Updated
β’
102
β’
4

opencompass/CompassJudger-1-14B-Instruct
Text Generation
β’
15B
β’
Updated
β’
22
β’
2

opencompass/CompassJudger-1-32B-Instruct
Text Generation
β’
33B
β’
Updated
β’
56
β’
16

opencompass/CompassJudger-1-1.5B-Instruct
2B
β’
Updated
β’
15
β’
1

opencompass/CompassJudger-1-7B-Instruct
8B
β’
Updated
β’
1.78k
β’
9

opencompass/mixtral-8x7b-32k
Updated
β’
1
datasets
11
opencompass/LiveMathBench
Viewer
β’
Updated
β’
483
β’
1.09k
β’
7
opencompass/NeedleBench
Viewer
β’
Updated
β’
6.8k
β’
2.2k
β’
5
opencompass/compass_academic_predictions
Viewer
β’
Updated
β’
4.42M
β’
11
opencompass/Creation-MMBench
Viewer
β’
Updated
β’
765
β’
176
β’
2
opencompass/anah
Viewer
β’
Updated
β’
783
β’
97
β’
3
opencompass/AIME2025
Viewer
β’
Updated
β’
30
β’
3.93k
β’
21
opencompass/mmmlu_lite
Viewer
β’
Updated
β’
20k
β’
37
β’
2
opencompass/MMBench-Video
Preview
β’
Updated
β’
286
β’
8
opencompass/flames
Viewer
β’
Updated
β’
537
β’
63
opencompass/CriticBench
Updated
β’
343
β’
4