AI & ML interests
None yet
Organizations
None yet
models
18
ALEXIOSTER/Humorous_SFT_LLama2_7b
Updated
ALEXIOSTER/Humorous_DPO_LLama2_7b
Updated
ALEXIOSTER/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
ALEXIOSTER/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
ALEXIOSTER/ppo-CartPole-v1
Reinforcement Learning
•
Updated
ALEXIOSTER/poca-SoccerTwos
Reinforcement Learning
•
Updated
ALEXIOSTER/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
•
4
ALEXIOSTER/ppo-SnowballTarget
Reinforcement Learning
•
Updated