Xinyu Zhu's picture

Xinyu Zhu

TianHongZXY

·

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

updated a model 13 days ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

published a model 22 days ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

liked a dataset about 1 month ago

Xnhyacinth/LongBench

View all activity

Organizations

Collections 2

Papers 13

arxiv:2603.00889

arxiv:2506.01347

arxiv:2506.15710

arxiv:2409.18786

models 12

TianHongZXY/CHIMERA-4B-SFT

4B • Updated Mar 2 • 10 • 2

TianHongZXY/CHIMERA-4B-RL

4B • Updated Mar 2 • 3 • 4

TianHongZXY/Qwen3-4B-NSR

4B • Updated Dec 6, 2025

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28, 2025

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8, 2025

TianHongZXY/Qwen2.5-Math-7B-W-REINFORCE

8B • Updated Jun 1, 2025 • 11 • 1

TianHongZXY/Qwen3-4B-GRPO

4B • Updated May 31, 2025 • 20

TianHongZXY/Qwen3-4B-PPO

4B • Updated May 31, 2025 • 2

TianHongZXY/Qwen3-4B-PSR

4B • Updated May 31, 2025 • 3 • 1

TianHongZXY/Qwen2.5-Math-7B-PPO

8B • Updated May 31, 2025 • 2

datasets 6

TianHongZXY/CHIMERA

Viewer • Updated Apr 17 • 9.23k • 641 • 21

TianHongZXY/aime-1983-2025

Viewer • Updated Apr 16, 2025 • 963 • 163

TianHongZXY/AIME2025

Viewer • Updated Mar 22, 2025 • 30 • 473 • 1

TianHongZXY/AIME2024

Viewer • Updated Mar 22, 2025 • 30 • 220

TianHongZXY/amc23

Viewer • Updated Mar 22, 2025 • 40 • 485

TianHongZXY/MATH

Viewer • Updated Jan 12, 2025 • 12.5k • 536 • 3