Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
12
15
LLM-Leaderboard
StarscreamDeceptions
Follow
21world's profile picture
longyuewang's profile picture
thomwolf's profile picture
4 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
17 days ago
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
upvoted
a
paper
17 days ago
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
updated
a Space
21 days ago
AIDC-AI/Marco-MT-Algharb
View all activity
Organizations
StarscreamDeceptions
's Spaces
1
Sort:ย Recently updated
pinned
Running
22
๐ Multilingual MMLU Benchmark Leaderboard
๐
View and submit LLM benchmarks