AI & ML interests
None yet
Organizations
None yet
yangwj2011/NeuralHermes-2.5-Mistral-7B
Text Generation
•
7B
•
Updated
•
12
yangwj2011/poca_SoccerTwos
Reinforcement Learning
•
Updated
•
25
yangwj2011/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
yangwj2011/ppo-LunarLander-v2-unit8
Reinforcement Learning
•
Updated
yangwj2011/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
•
2
yangwj2011/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
3
yangwj2011/ppo-Pyramids-Training
Reinforcement Learning
•
Updated
•
16
yangwj2011/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
29
yangwj2011/Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
yangwj2011/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
5
yangwj2011/Taxi_v3_qlearning
Reinforcement Learning
•
Updated
yangwj2011/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
yangwj2011/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
2