chenggong1995/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-epoch3 Text Generation • 8B • Updated Apr 13 • 8
chenggong1995/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-grpo-epoch3 Text Generation • 8B • Updated Apr 12 • 6
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-epoch1 Text Generation • 8B • Updated Apr 24 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt0 Text Generation • 8B • Updated Apr 26 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1 Text Generation • 8B • Updated Apr 26 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold30-3Dhint-prompt1 Text Generation • 8B • Updated Apr 28 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold10-hint0.5-prompt1-dp Text Generation • 8B • Updated Apr 29 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint0.5-prompt0-epoch4 Text Generation • 8B • Updated May 4 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-2Dhint-prompt1-epoch4-new Text Generation • 8B • Updated May 5 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-2Dhint-prompt1-GPU71 Text Generation • 8B • Updated May 6 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-mix-grpo-CL-beta1e-3-epoch1-v2 Text Generation • 8B • Updated May 14 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint50-prompt1-redonum-test Text Generation • 8B • Updated May 16 • 5
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1-epoch5-new44 Text Generation • 8B • Updated May 21 • 6
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-beta0-epoch5-new44 Text Generation • 8B • Updated May 20 • 6