-
225
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
51
Stick To Your Role! Leaderboard
๐ญBenchmarking LLMs on the stability of simulated populations
-
53
ZeroEval Leaderboard
๐Embed and use ZeroEval for evaluation tasks
-
26
Decentralized Arena Leaderboard
๐ฅDisplay model leaderboard evaluations
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
liked
a model
about 23 hours ago
city96/umt5-xxl-encoder-gguf
liked
a model
about 23 hours ago
wikeeyang/Magic-Wan-Image-v1.0
liked
a model
about 23 hours ago
befox/WAN2.2-14B-Rapid-AllInOne-GGUF
Organizations
None yet