chenggong1995/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2 Text Generation • Updated 5 days ago
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW Text Generation • Updated 5 days ago • 594
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP Text Generation • Updated about 21 hours ago • 131
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v2 Text Generation • Updated about 1 hour ago