Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
HuggingFaceH4/lm-eval-leaderboard
open-r1
/
open-r1-eval-leaderboard
like
83
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
main
open-r1-eval-leaderboard
/
eval_results
Ctrl+K
Ctrl+K
2 contributors
History:
25583 commits
lewtun
HF Staff
Upload eval_results/HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd/main/aime24/results_2025-06-05T15-21-44.031115.json with huggingface_hub
22e132f
verified
1 day ago
HuggingFaceH4
Upload eval_results/HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd/main/aime24/results_2025-06-05T15-21-44.031115.json with huggingface_hub
1 day ago
HuggingFaceTB
Upload eval_results/HuggingFaceTB/SmolLM2-1.7B-Instruct/main/gsm8k_8k/results_2025-02-13T08-41-05.696800.json with huggingface_hub
4 months ago
PrimeIntellect
Upload eval_results/PrimeIntellect/SYNTHETIC-1-SFT-7B/main/lcb/results_2025-03-04T13-04-27.679164.json with huggingface_hub
3 months ago
Qwen
Upload eval_results/Qwen/Qwen2.5-1.5B-Instruct/main/aime24/results_2025-06-05T13-36-11.923818.json with huggingface_hub
1 day ago
deepseek-ai
Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/lcb/results_2025-05-23T13-49-36.053706.json with huggingface_hub
14 days ago
lewtun
Upload eval_results/lewtun/Qwen2.5-1.5B-Open-R1-Distill/main/math_500/results_2025-02-17T15-24-15.837145.json with huggingface_hub
4 months ago
mistralai
Upload eval_results/mistralai/Codestral-22B-v0.1/main/lcb/results_2025-03-05T11-41-53.582539.json with huggingface_hub
3 months ago
open-r1
Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000013650/lcb_v4/results_2025-06-03T02-05-06.521271.json with huggingface_hub
4 days ago
open-thoughts
Upload eval_results/open-thoughts/OpenThinker-7B/main/lcb/results_2025-05-28T13-34-20.139758.json with huggingface_hub
9 days ago
qgallouedec
Upload eval_results/qgallouedec/R1-Zero-Qwen-7B-Math/v06.02-step-000000511/math_500/results_2025-05-01T15-28-47.000538.json with huggingface_hub
about 1 month ago