Hanze Dong's picture

Hanze Dong

hendrydong

·

https://hendrydong.github.io

hendrydong

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

upvoted a paper about 1 month ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

New activity in RLHFlow/LLaMA3.2-1B-SFT about 1 year ago

the training data for this model?

#1 opened about 1 year ago by

New activity in sfairXC/FsfairX-LLaMA3-RM-v0.1 over 1 year ago

Update README.md

#4 opened almost 2 years ago by

Update README.md

#6 opened over 1 year ago by

New activity in sfairXC/FsfairX-LLaMA3-RM-v0.1 almost 2 years ago

Training details?

#2 opened almost 2 years ago by

New activity in microsoft/phi-2 about 2 years ago

How to Train model with AutoModelForSequenceClassification?

#20 opened about 2 years ago by