Thomas Yap
wooihen
AI & ML interests
machine learning, NLP, computer vision and RL
Organizations
wooihen's activity
-
-
-
-
-
-
-
-
-
-
-
view article
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
view article
How to deploy and fine-tune DeepSeek models on AWS
view article
How we leveraged distilabel to create an Argilla 2.0 Chatbot