Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
20
5
4
Hanbin Wang
hanbin
Follow
ZSKHGA's profile picture
shuyuej's profile picture
junwux's profile picture
13 followers
·
4 following
https://wanghanbinpanda.github.io/
wanghanbinpanda
AI & ML interests
Code Intelligence and LLM Reasoning (Code, Math)
Recent Activity
updated
a model
about 12 hours ago
hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat
updated
a model
about 13 hours ago
hanbin/Llama-3.1-8B-pes2o-anneal-2.7B_oasst1_wildchat
published
a model
about 19 hours ago
hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat
View all activity
Organizations
hanbin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
authored
a paper
6 months ago
Process Reinforcement through Implicit Rewards
Paper
•
2502.01456
•
Published
Feb 3
•
62