Upload folder using huggingface_hub

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,29 +1,29 @@
 ---
 tags:
-- PongNoFrameskip-v4
 - ppo
 - reinforcement-learning
 - stable-baselines3
 - deep-rl
 - atari
 model-index:
-  - name: PPO Pong Atari
     results:
       - task:
           type: reinforcement-learning
-          name: Pong
         dataset:
-          name: PongNoFrameskip-v4
           type: atari
         metrics:
           - name: Mean Reward
             type: mean_reward
-            value: 21.00 +/- 0.00
 ---
-# **PPO** Agent playing **PongNoFrameskip-v4**
-This is a trained model of a **PPO** agent playing **PongNoFrameskip-v4**.
 To learn to use this model and train yours, check the Deep Reinforcement Learning Course on [Hugging Face](https://huggingface.co/deep-rl-course).

 ---
 tags:
+- BreakoutNoFrameskip-v4
 - ppo
 - reinforcement-learning
 - stable-baselines3
 - deep-rl
 - atari
 model-index:
+  - name: PPO Breakout
     results:
       - task:
           type: reinforcement-learning
+          name: Breakout
         dataset:
+          name: BreakoutNoFrameskip-v4
           type: atari
         metrics:
           - name: Mean Reward
             type: mean_reward
+            value: 43.50 +/- 22.70
 ---
+# **PPO** Agent playing **BreakoutNoFrameskip-v4**
+This is a trained model of a **PPO** agent playing **BreakoutNoFrameskip-v4**.
 To learn to use this model and train yours, check the Deep Reinforcement Learning Course on [Hugging Face](https://huggingface.co/deep-rl-course).

hyperparameters.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"env_id": "~~PongNoFrameskip~~-v4", "max_t": 1000, "n_evaluation_episodes": 10}


1	+ {"env_id": "BreakoutNoFrameskip-v4", "max_t": 1000, "n_evaluation_episodes": 10}

ppo_breakout.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:017d3fd0bc4ff498bf573ec11475100cf0d20c27137b87e381957f4d959ed5d5
+size 25496839

replay.mp4 CHANGED Viewed

Binary files a/replay.mp4 and b/replay.mp4 differ

results.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"env_id": "~~PongNoFrameskip~~-v4", "mean_reward": 21.0, "n_evaluation_episodes": 10, "eval_datetime": "2024-10-~~20T14~~:03:35.~~744333~~"}


1	+ {"env_id": "BreakoutNoFrameskip-v4", "mean_reward": 43.5, "n_evaluation_episodes": 10, "eval_datetime": "2024-10-20T19:57:57.978604"}