Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Haokai Zhao
jz666
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
upvoted
a
paper
8 days ago
Multiplayer Nash Preference Optimization
updated
a model
9 days ago
jz666/simpo
View all activity
Organizations
None yet
models
1
jz666/simpo
Text Generation
•
9B
•
Updated
9 days ago
•
10
datasets
0
None public yet