arxiv:2501.03262
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
liked
a model
2 days ago
CohereForAI/c4ai-command-r7b-12-2024
upvoted
a
paper
10 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
commented on
a paper
10 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models