Wang

VincentWang

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

liked a dataset 14 days ago

EricLu/SCP-116K

liked a dataset 15 days ago

a-m-team/AM-DeepSeek-Distilled-40M

View all activity

Organizations

None yet

VincentWang's activity

liked a model 6 days ago

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated Apr 13 • 1.44k • 87

liked a dataset 14 days ago

EricLu/SCP-116K

Viewer • Updated Mar 17 • 182k • 706 • 100

liked a dataset 15 days ago

a-m-team/AM-DeepSeek-Distilled-40M

Viewer • Updated May 10 • 11.5M • 16.6k • 42

liked a dataset 19 days ago

allenai/tulu-3-sft-personas-instruction-following

Viewer • Updated Nov 21, 2024 • 30k • 3.68k • 29

liked a model 26 days ago

TIGER-Lab/general-verifier

Question Answering • Updated Apr 15 • 9.04k • 14

upvoted an article 29 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 41

liked 2 datasets 30 days ago

ucinlp/drop

Viewer • Updated Jan 17, 2024 • 86.9k • 2.66k • 54

deepmind/aqua_rat

Viewer • Updated Jan 9, 2024 • 196k • 3.86k • 63

liked a model 2 months ago

Xenova/text-embedding-ada-002

Updated Mar 27, 2024 • 78

liked a model 4 months ago

Daemontatox/Zireal-0

Text Generation • Updated Mar 4 • 16 • 1

liked a Space 5 months ago

5.82k

MTEB Leaderboard

🥇

Embedding Leaderboard

liked a dataset 8 months ago

lvwerra/stack-exchange-paired

Viewer • Updated Mar 13, 2023 • 31.3M • 2.5k • 144

upvoted a paper 9 months ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 49

liked a model 10 months ago

Lenovo-Zhihui/Zhihui_LLM_Embedding

Feature Extraction • Updated Jul 1, 2024 • 24 • 15

liked a Space 10 months ago

AIR-Bench Leaderboard

🥇

Explore benchmark results for QA and long doc models

liked a model 11 months ago