Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ganqu Cui's picture
5 14 19

Ganqu Cui

ganqu
BryantMcGill's profile picture YingxuanW's profile picture Cadena's profile picture
·
  • cgq15

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
authored a paper 7 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
upvoted a paper 7 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity

Organizations

OpenBMB's profile picture PRIME's profile picture

Articles 1

Article
27

Process Reinforcement through Implicit Rewards

Papers 16

arxiv:2505.22617
arxiv:2504.16084
arxiv:2504.14945
arxiv:2503.21614

models 0

None public yet

datasets 1

ganqu/openbackdoor

Preview • Updated Oct 23, 2024 • 79
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs