The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
updated
a model
7 days ago
TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28
published
a model
7 days ago
TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28
updated
a model
11 days ago
TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28