leoking's picture

6 7

leoking

leokmax

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a model 4 months ago

deepseek-ai/DeepSeek-V3-0324

liked a Space 5 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 309

upvoted a collection 5 months ago

Deepseek Papers

Deepseek papers collection • 24 items • Updated 11 days ago • 264

upvoted 4 collections 9 months ago

LLM Pre-Train

16 items • Updated Jan 20 • 1

LLM Post Training

15 items • Updated Feb 1 • 1

LLM Reasoning Papers

improve reasoning capabilities of LLMs • 45 items • Updated Feb 18 • 5

LLM Tech Report

33 items • Updated Feb 21 • 2