RLHFlow

university

RLHFlow

RLHFlow

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

hendrydong updated a collection about 1 month ago

hendrydong updated a collection about 1 month ago

hendrydong updated a model about 1 month ago

RLHFlow/Qwen2.5-Math-7B-Zero-RAFTpp

View all activity

RLHFlow 's collections 11