Perusha Moodley's picture

6 8

Perusha Moodley

moodlep

·

https://www.perusha.dev/

AI & ML interests

RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods

Recent Activity

liked a dataset 2 days ago

Anthropic/hh-rlhf

upvoted a paper 14 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

updated a model 21 days ago

moodlep/smollm2-17b-dpo-cai-v1

View all activity

Organizations

moodlep's activity

upvoted a paper 14 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97

upvoted a collection 30 days ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated 24 days ago • 22

upvoted a collection about 1 month ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 1 day ago • 64

upvoted an article 9 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22, 2024

• 80

upvoted a paper 10 months ago

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 17