Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
19
yuchang
hiyuchang
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 13 hours ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
upvoted
a
paper
3 months ago
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
upvoted
an
article
6 months ago
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
View all activity
Organizations
None yet
hiyuchang
's models
None public yet