Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
12
15
LLM-Leaderboard
StarscreamDeceptions
Follow
thomwolf's profile picture
21world's profile picture
binwang's profile picture
4 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
upvoted
a
paper
3 days ago
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
updated
a Space
7 days ago
AIDC-AI/Marco-MT-Algharb
View all activity
Organizations
spaces
1
pinned
Running
22
๐ Multilingual MMLU Benchmark Leaderboard
๐
View and submit LLM benchmarks
models
0
None public yet
datasets
2
Sort:ย Recently updated
StarscreamDeceptions/results
Viewer
โข
Updated
Nov 13, 2024
โข
17
โข
9
StarscreamDeceptions/requests
Preview
โข
Updated
Nov 13, 2024
โข
17