Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
2
Qingping Yang
qingping95
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning
upvoted
a
paper
2 days ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning
authored
a paper
about 2 months ago
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
View all activity
Organizations
Papers
2
arxiv:
2505.11896
arxiv:
2503.22230
models
0
None public yet
datasets
0
None public yet