Hanze Dong
hendrydong
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
Self-Hinting Language Models Enhance Reinforcement Learning
upvoted
a
paper
about 1 month ago
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
upvoted
a
paper
about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization