Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
6
5
garyzhang
xiaoniqiu
Follow
0 followers
ยท
5 following
garyzhang99
AI & ML interests
LLM, Agents
Recent Activity
commented
on
a paper
about 9 hours ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
commented
on
a paper
1 day ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
authored
a paper
2 days ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
View all activity
Organizations
None yet
Papers
1
arxiv:
2508.11408
models
0
None public yet
datasets
0
None public yet