Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GuanOrg
/
DeepRLCourse2022

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card Files Files and versions Community
DeepRLCourse2022
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
bguan's picture
bguan
bguan's lunar lander model #3 using PPO trained for 1M timesteps
ee17131 about 3 years ago
  • bguan_ppo_lunarlander
    bguan's lunar lander model using PPO trained for 500K timesteps about 3 years ago
  • bguan_ppo_lunarlander2
    bguan's lunar lander model #2 using PPO trained for 500K timesteps about 3 years ago
  • bguan_ppo_lunarlander3
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago
  • .gitattributes
    1.22 kB
    bguan's lunar lander model using PPO trained for 500K timesteps about 3 years ago
  • README.md
    677 Bytes
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago
  • bguan_ppo_lunarlander.zip
    144 kB
    LFS
    bguan's lunar lander model using PPO trained for 500K timesteps about 3 years ago
  • bguan_ppo_lunarlander2.zip
    144 kB
    LFS
    bguan's lunar lander model #2 using PPO trained for 500K timesteps about 3 years ago
  • bguan_ppo_lunarlander3.zip
    144 kB
    LFS
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago
  • config.json
    14.4 kB
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago
  • replay.mp4
    245 kB
    LFS
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago
  • results.json
    165 Bytes
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 3 years ago