Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
14
19
Ganqu Cui
ganqu
Follow
BryantMcGill's profile picture
YingxuanW's profile picture
Cadena's profile picture
19 followers
·
2 following
cgq15
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
authored
a paper
7 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
upvoted
a
paper
7 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity
Organizations
Articles
1
Article
27
Process Reinforcement through Implicit Rewards
Papers
16
arxiv:
2505.22617
arxiv:
2504.16084
arxiv:
2504.14945
arxiv:
2503.21614
Expand 16 papers
models
0
None public yet
datasets
1
ganqu/openbackdoor
Preview
•
Updated
Oct 23, 2024
•
79