Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 14 days ago
PRIME-RL/Eurus-2-7B-PRIME-Zero
published a model 14 days ago
PRIME-RL/Eurus-2-7B-PRIME-Zero
new activity about 1 month ago
PRIME-RL/Eurus-2-7B-PRIME:real usage query
View all activity

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME about 1 month ago

real usage query

1
#4 opened about 1 month ago by
asidaddy
updated a Space about 2 months ago
New activity in PRIME-RL/Eurus-2-RL-Data about 2 months ago
New activity in PRIME-RL/Eurus-2-7B-PRIME 3 months ago

Evaluation

6
#1 opened 3 months ago by
tugstugi
upvoted an article 3 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
25
published an article 3 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
25