arxiv:2411.00369
Anish Pahilajani
Anish13
·
AI & ML interests
None yet
Organizations
models
31
Anish13/orpheus-3b-0.1-ft-q4f16_1-MLC
Updated
Anish13/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Anish13/ppo-LunarLander
Reinforcement Learning
•
Updated
Anish13/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
31
Anish13/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Anish13/ppo-Pyramids
Reinforcement Learning
•
Updated
Anish13/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Anish13/Reinforce-cartpole_policy
Reinforcement Learning
•
Updated
Anish13/Reinforce-Pixelcopter-PLE-v0_1
Reinforcement Learning
•
Updated
Anish13/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
datasets
0
None public yet