Andy Andurkar

AndyAndurkar

AI & ML interests

None yet

Recent Activity

upvoted an article about 9 hours ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a Space 30 days ago

Vchitect/VBench_Leaderboard

upvoted an article about 1 month ago

Vision Language Models Explained

View all activity

Organizations

None yet

upvoted an article about 9 hours ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

389

upvoted an article about 1 month ago

Article

Vision Language Models Explained

Apr 11, 2024

•

505

upvoted 4 articles 6 months ago

Article

🦸🏻#11: How Do Agents Plan and Reason?

Feb 24, 2025

•

Article

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Feb 15, 2025

•

Article

Everything You Need to Know about Knowledge Distillation

Mar 6, 2025

•

Article

Inside the family of Smol models

Feb 27, 2025

•

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

Andy Andurkar

AI & ML interests

Recent Activity

Organizations

AndyAndurkar's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Vision Language Models Explained

🦸🏻#11: How Do Agents Plan and Reason?

🦸🏻#10: Does Present-Day GenAI Actually Reason?

Everything You Need to Know about Knowledge Distillation

Inside the family of Smol models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge