DestinyW's picture

2

DestinyW

DestinyW

·

AI & ML interests

None yet

Organizations

None yet

DestinyW's activity

upvoted 2 articles 3 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 41

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 151