Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
23
39
Anirudh Thatipelli
Anirudh25
Follow
Mi6paulino's profile picture
Theartplug's profile picture
2 followers
·
52 following
https://anirudh257.github.io/
Anirudh257
AI & ML interests
None yet
Recent Activity
liked
a dataset
2 days ago
simplescaling/s1K_tokenized
upvoted
a
collection
2 days ago
Qwen2.5
upvoted
a
paper
2 days ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
View all activity
Organizations
None yet
models
10
Sort: Recently updated
Anirudh25/ppo-Breakout-v4
Updated
Jun 1, 2023
Anirudh25/a2c-Breakout-v4
Updated
Jun 1, 2023
Anirudh25/dqn-Breakout-v4
Updated
Jun 1, 2023
Anirudh25/ppo-PongNoFrameskip-v4
Updated
Jun 1, 2023
Anirudh25/a2c-PongNoFrameskip-v4
Updated
Jun 1, 2023
Anirudh25/dqn-PongNoFrameskip-v4
Updated
Jun 1, 2023
Anirudh25/ppo-SpaceInvadersNoFrameskip-v4
Updated
Jun 1, 2023
Anirudh25/dqn-SpaceInvadersNoFrameskip-v4
Updated
Jun 1, 2023
Anirudh25/a2c-SpaceInvadersNoFrameskip-v4
Updated
May 31, 2023
Anirudh25/ppo-LunarLander-v2-TEST
Reinforcement Learning
•
Updated
Apr 30, 2023
•
1
datasets
0
None public yet