20 5 4

Hanbin Wang

hanbin

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

published a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

new activity 3 months ago

PRIME-RL/Eurus-2-7B-PRIME:real usage query

View all activity

Organizations

hanbin's activity

updated a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • Updated Mar 14 • 101 • 1

published a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • Updated Mar 14 • 101 • 1

New activity in PRIME-RL/Eurus-2-7B-PRIME 3 months ago

real usage query

#4 opened 3 months ago by

asidaddy

authored a paper 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

updated 3 datasets 3 months ago

updated 4 models 3 months ago

PRIME-RL/EurusPRM-Stage2

Updated Feb 19 • 282 • 6

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated Feb 19 • 695 • 60

PRIME-RL/Eurus-2-7B-SFT

Updated Feb 19 • 4.03k • 2

PRIME-RL/EurusPRM-Stage1

Updated Feb 19 • 330 • 4

updated a Space 3 months ago

README

🏃

upvoted a paper 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

commented a paper 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61 •

New activity in PRIME-RL/Eurus-2-RL-Data 3 months ago

some empty code ground truths (roughly 1k in train)

#3 opened 3 months ago by

rawsh

New activity in PRIME-RL/Eurus-2-7B-PRIME 4 months ago

Evaluation

#1 opened 4 months ago by

tugstugi

Add library_name and pipeline_tag

#2 opened 4 months ago by

nielsr

upvoted an article 4 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 27

published an article 4 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 27

liked a model 4 months ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated Feb 19 • 695 • 60