Yuchen Fan

yuchenFan

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

upvoted a paper about 1 month ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

updated a dataset about 2 months ago

yuchenFan/Search-R1

View all activity

Organizations

authored a paper about 1 month ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 124

upvoted a paper about 1 month ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 124

updated a dataset about 2 months ago

yuchenFan/Search-R1

Updated May 17 • 7

published a dataset about 2 months ago

yuchenFan/Search-R1

Updated May 17 • 7

liked a Space about 2 months ago

215

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

了解LLM训练的方方面面

upvoted 2 papers 3 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 117

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28 • 39

authored a paper 4 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 28

upvoted a paper 4 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 28

updated a model 4 months ago

yuchenFan/EDIT-SFT

8B • Updated Feb 26 • 6

published a model 4 months ago

yuchenFan/EDIT-SFT

8B • Updated Feb 26 • 6

updated a model 5 months ago

yuchenFan/Difficulty-Classifier-qwen-7b-inst

8B • Updated Feb 23 • 7

published a model 5 months ago

yuchenFan/Difficulty-Classifier-qwen-7b-inst

8B • Updated Feb 23 • 7

authored a paper 5 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

upvoted 2 papers 5 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

updated 2 models 6 months ago

PRIME-RL/EurusPRM-Stage2

8B • Updated Feb 19 • 1.69k • 7

PRIME-RL/EurusPRM-Stage1

8B • Updated Feb 19 • 1.73k • 4

updated a dataset 6 months ago

PRIME-RL/Eurus-2-Rollout

Viewer • Updated Jan 13 • 300k • 14 • 2

liked a model 6 months ago

PRIME-RL/EurusPRM-Stage2

8B • Updated Feb 19 • 1.69k • 7

Yuchen Fan

AI & ML interests

Recent Activity

Organizations

yuchenFan's activity

LLM训练终极指南 | The Ultra-Scale Playbook