open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/open-thoughts/OpenThinker3-7B/main/lcb/results_2025-06-07T07-04-08.714563.json with huggingface_hub
3912fd8
Running
verified

lewtun HF Staff commited on

Upload eval_results/open-thoughts/OpenThinker3-7B/main/lcb_v4/results_2025-06-07T01-17-35.943558.json with huggingface_hub
bdddaff
verified

lewtun HF Staff commited on

Upload eval_results/open-thoughts/OpenThinker3-7B/main/aime25/results_2025-06-07T00-31-44.825002.json with huggingface_hub
409523d
verified

lewtun HF Staff commited on

Upload eval_results/open-thoughts/OpenThinker3-7B/main/aime24/results_2025-06-07T00-25-02.008769.json with huggingface_hub
33fc454
verified

lewtun HF Staff commited on

Upload eval_results/open-thoughts/OpenThinker3-7B/main/gpqa/results_2025-06-06T23-22-32.086077.json with huggingface_hub
6c94254
verified

lewtun HF Staff commited on

Upload eval_results/open-thoughts/OpenThinker3-7B/main/math_500/results_2025-06-06T22-58-08.488036.json with huggingface_hub
0cc0f95
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd/main/aime24/results_2025-06-05T15-21-44.031115.json with huggingface_hub
22e132f
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd/main/gpqa/results_2025-06-05T14-19-10.563460.json with huggingface_hub
c9b3b65
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd/main/math_500/results_2025-06-05T14-14-08.048573.json with huggingface_hub
bd6c846
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2.5-1.5B-Instruct/main/aime24/results_2025-06-05T13-36-11.923818.json with huggingface_hub
00e4198
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2.5-1.5B-Instruct/main/math_500/results_2025-06-05T13-22-48.647883.json with huggingface_hub
99f5f9f
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2.5-1.5B-Instruct/main/gpqa/results_2025-06-05T13-21-00.794335.json with huggingface_hub
82ebbd0
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000013650/lcb_v4/results_2025-06-03T02-05-06.521271.json with huggingface_hub
5ee04cd
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000013650/aime24/results_2025-06-03T01-34-00.767451.json with huggingface_hub
099cdad
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000013650/gpqa/results_2025-06-03T00-39-28.428001.json with huggingface_hub
631960a
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.11-step-000000100/aime24/results_2025-06-02T02-27-19.563440.json with huggingface_hub
7668eb1
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.10-step-000000100/aime24/results_2025-06-02T02-26-22.513221.json with huggingface_hub
493451c
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.09-step-000000200/aime24/results_2025-06-01T22-45-13.967295.json with huggingface_hub
d6dd2c0
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.07-step-000000100/aime24/results_2025-06-01T18-03-55.184225.json with huggingface_hub
f793eaa
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.09-step-000000100/aime24/results_2025-06-01T17-41-07.321299.json with huggingface_hub
aeea3de
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000001200/aime24/results_2025-05-31T18-52-03.952787.json with huggingface_hub
bca3be7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.03-step-000001200/aime24/results_2025-05-31T17-53-49.636195.json with huggingface_hub
395daf7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.01-step-000000700/aime24/results_2025-05-31T15-03-13.035591.json with huggingface_hub
7a8b9e6
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000010920/lcb_v4/results_2025-05-31T14-27-08.682069.json with huggingface_hub
373cd02
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000010920/aime24/results_2025-05-31T14-06-03.782222.json with huggingface_hub
04011e6
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.04-step-000000700/aime24/results_2025-05-31T13-17-59.875372.json with huggingface_hub
9bdc5d9
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000001100/aime24/results_2025-05-31T13-04-59.979063.json with huggingface_hub
34b66be
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000010920/gpqa/results_2025-05-31T13-04-31.943339.json with huggingface_hub
453d81a
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.05-step-000000500/aime24/results_2025-05-31T12-46-03.333121.json with huggingface_hub
9b38f56
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.03-step-000001100/aime24/results_2025-05-31T12-11-39.813760.json with huggingface_hub
26438f8
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.01-step-000000600/aime24/results_2025-05-31T09-41-21.448999.json with huggingface_hub
c538887
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.03-step-000001000/aime24/results_2025-05-31T09-41-14.547872.json with huggingface_hub
9e9f846
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.04-step-000000600/aime24/results_2025-05-31T09-35-59.175649.json with huggingface_hub
bb6e331
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000001000/aime24/results_2025-05-31T09-35-04.340200.json with huggingface_hub
00405d2
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.02-step-000000400/aime24/results_2025-05-31T03-12-26.130348.json with huggingface_hub
3a730d5
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000000900/aime24/results_2025-05-31T01-43-22.498967.json with huggingface_hub
27698e1
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000008190/lcb_v4/results_2025-05-30T22-31-53.129801.json with huggingface_hub
b9fd864
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000008190/aime24/results_2025-05-30T22-02-42.234696.json with huggingface_hub
1cacff6
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-3B/v00.00-step-000008190/gpqa/results_2025-05-30T21-02-57.029717.json with huggingface_hub
ceeb63c
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.01-step-000000500/aime24/results_2025-05-30T20-48-21.579698.json with huggingface_hub
ee7d44a
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000000800/aime24/results_2025-05-30T19-59-50.796955.json with huggingface_hub
6efaa3f
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.04-step-000000500/aime24/results_2025-05-30T19-54-06.740290.json with huggingface_hub
da00fd7
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.03-step-000000800/aime24/results_2025-05-30T19-16-01.214600.json with huggingface_hub
3b5e65a
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000000700/aime24/results_2025-05-30T14-13-27.258331.json with huggingface_hub
c0fcf69
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.03-step-000000700/aime24/results_2025-05-30T13-37-13.857215.json with huggingface_hub
18bf45f
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.02-step-000000300/aime24/results_2025-05-30T13-20-44.875078.json with huggingface_hub
7f07649
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.05-step-000000300/aime24/results_2025-05-30T11-44-57.556871.json with huggingface_hub
d65452d
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.01-step-000000400/aime24/results_2025-05-30T11-18-54.853969.json with huggingface_hub
cbc4abd
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.04-step-000000400/aime24/results_2025-05-30T10-18-04.224739.json with huggingface_hub
b4df729
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO/v12.00-step-000000600/aime24/results_2025-05-30T08-27-16.276893.json with huggingface_hub
0dfd4fe
verified

edbeeching HF Staff commited on