dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated May 18 • 12k • 63
dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 1k • 62
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29 • 500 • 66
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 21 • 500 • 34
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192 Viewer • Updated Apr 19 • 12k • 40
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 19 • 12k • 47
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated Apr 18 • 12k • 76