LLM-Leaderboard's picture

LLM-Leaderboard

StarscreamDeceptions

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking

upvoted a paper 17 days ago

HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application

updated a Space 21 days ago

AIDC-AI/Marco-MT-Algharb

View all activity

Organizations

StarscreamDeceptions 's Spaces 1

🌐 Multilingual MMLU Benchmark Leaderboard

View and submit LLM benchmarks