AI & ML interests
None yet
Organizations
None yet
ALEXIOSTER/Humorous_SFT_LLama2_7b
Updated
ALEXIOSTER/Humorous_DPO_LLama2_7b
Updated
ALEXIOSTER/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
ALEXIOSTER/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
ALEXIOSTER/ppo-CartPole-v1
Reinforcement Learning
•
Updated
ALEXIOSTER/poca-SoccerTwos
Reinforcement Learning
•
Updated
ALEXIOSTER/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
•
1
ALEXIOSTER/ppo-SnowballTarget
Reinforcement Learning
•
Updated
ALEXIOSTER/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
ALEXIOSTER/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
ALEXIOSTER/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
ALEXIOSTER/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
ALEXIOSTER/LunarLander-v2
Reinforcement Learning
•
Updated
ALEXIOSTER/sft_openassistant-guanaco
Updated
ALEXIOSTER/gpt2-imdb-pos-v2
Text Generation
•
0.1B
•
Updated
•
4