Anish13
·
AI & ML interests
None yet
Organizations
Anish13/orpheus-3b-0.1-ft-q4f16_1-MLC
Updated
Anish13/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Anish13/ppo-LunarLander
Reinforcement Learning
•
Updated
Anish13/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
39
Anish13/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Anish13/ppo-Pyramids
Reinforcement Learning
•
Updated
•
1
Anish13/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Anish13/Reinforce-cartpole_policy
Reinforcement Learning
•
Updated
Anish13/Reinforce-Pixelcopter-PLE-v0_1
Reinforcement Learning
•
Updated
Anish13/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
Anish13/Taxi-v3
Reinforcement Learning
•
Updated
Anish13/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Anish13/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
2
Anish13/dev_results_model8_new
62.8M
•
Updated
Anish13/dev_results_model9
Updated
Anish13/results_model8_new
63.2M
•
Updated
•
2
Anish13/results_scratch
62.3M
•
Updated
Anish13/results_sratch
62.3M
•
Updated
•
1
Anish13/results_model8
63.3M
•
Updated
•
1
Anish13/results_model5
62.8M
•
Updated
•
5
Anish13/checkpoint-151000
62.6M
•
Updated
•
28
Anish13/results_model6_2
62.6M
•
Updated
•
2
Anish13/results_model6
0.3B
•
Updated
•
1
Anish13/results_model5_gpu1
Updated
Anish13/results_model3
52.8M
•
Updated
•
2
Anish13/results
63.3M
•
Updated
Anish13/results_model4_small
Updated
Anish13/junk
75.9M
•
Updated
Anish13/part3
Updated
Anish13/pretrained_fine_tune_gpt2
Text Classification
•
0.1B
•
Updated
•
2