Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Shusheng Xu
xssstory
Follow
21world's profile picture
1 follower
ยท
1 following
xsssotry
AI & ML interests
None yet
Recent Activity
authored
a paper
9 days ago
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
authored
a paper
9 days ago
On Designing Effective RL Reward at Training Time for LLM Reasoning
liked
a dataset
10 days ago
inclusionAI/AReaL-boba-Data
View all activity
Organizations
Papers
2
arxiv:
2410.15115
arxiv:
2404.10719
models
None public yet
datasets
None public yet