Wang's picture

2 14

Wang

VincentWang

·

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

liked a dataset 14 days ago

EricLu/SCP-116K

liked a dataset 15 days ago

a-m-team/AM-DeepSeek-Distilled-40M

View all activity

Organizations

None yet

VincentWang's activity

upvoted an article 29 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 41

upvoted a paper 9 months ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 49