A2C Training 3h

Files changed (11) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ model-index:
       type: PongNoFrameskip-v4
     metrics:
     - type: mean_reward
-      value: 21.00 +/- 0.00
       name: mean_reward
       verified: false
 ---

       type: PongNoFrameskip-v4
     metrics:
     - type: mean_reward
+      value: 19.60 +/- 0.80
       name: mean_reward
       verified: false
 ---

a2c_6h.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b94bf42f575a5bd8fb371a051a927f46fd403cf41b87bd43592bf40f3131daf3
+size 13593773

a2c_6h/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 2.0.0a5

a2c_6h/data ADDED Viewed

The diff for this file is too large to render. See raw diff

a2c_6h/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e72be3f0e5c470df35fd90d92db34d43daddd89ca1f97a89d6f990481e2b5c14
+size 6733134

a2c_6h/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:99c3cf74b6a5840405db23046e5fed0fe7b4b045c82897fbcc2a6ef9ffefcaa4
+size 6733298

a2c_6h/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ebdad4b9cfe9cd22a3abadb5623bf7bb1f6eb2e408740245eb3f2044b0adc018
+size 864

a2c_6h/system_info.txt ADDED Viewed

+- OS: macOS-14.7-arm64-arm-64bit Darwin Kernel Version 23.6.0: Wed Jul 31 20:48:52 PDT 2024; root:xnu-10063.141.1.700.5~1/RELEASE_ARM64_T6020
+- Python: 3.10.15
+- Stable-Baselines3: 2.0.0a5
+- PyTorch: 2.4.1
+- GPU Enabled: False
+- Numpy: 1.26.4
+- Cloudpickle: 2.2.1
+- Gymnasium: 0.28.1
+- OpenAI Gym: 0.25.2

config.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

replay.mp4 CHANGED Viewed

Binary files a/replay.mp4 and b/replay.mp4 differ

results.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"mean_reward": 21.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2024-10-~~12T09~~:57:55.~~671113~~"}


1	+ {"mean_reward": 19.6, "std_reward": 0.8, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2024-10-15T07:24:31.313852"}