Initial commit

Files changed (10) hide show

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- CarRacing-v0
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - metrics:
+    - type: mean_reward
+      value: 65.27 +/- 147.53
+      name: mean_reward
+    task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: CarRacing-v0
+      type: CarRacing-v0
+---
+# **PPO** Agent playing **CarRacing-v0**
+This is a trained model of a **PPO** agent playing **CarRacing-v0**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

ppo-CarRacing-v0.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:04feef31d9bd05a7f4240d228105847e7fd4d8a91096fb08d3ed0ee231f9a837
+size 26879741

ppo-CarRacing-v0/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 1.6.0

ppo-CarRacing-v0/data ADDED Viewed

The diff for this file is too large to render. See raw diff

ppo-CarRacing-v0/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c7a0e9aa545982a1ff61056c5ea88f941a48be309c22de7ff253527fa0ee945f
+size 17415536

ppo-CarRacing-v0/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9405b854d43b13f6a7b5f1a29aee04f90cd8d8c5734f1987a6346c5e62e34886
+size 8707070

ppo-CarRacing-v0/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
+size 431

ppo-CarRacing-v0/system_info.txt ADDED Viewed

+OS: Linux-5.4.0-113-generic-x86_64-with-debian-buster-sid #127-Ubuntu SMP Wed May 18 14:30:56 UTC 2022
+Python: 3.7.13
+Stable-Baselines3: 1.6.0
+PyTorch: 1.12.0
+GPU Enabled: True
+Numpy: 1.21.5
+Gym: 0.21.0

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": 65.26687043309212, "std_reward": 147.53181521026872, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-07-22T19:27:48.314989"}