Francesco-A
/

a2c-PandaReachDense-v3

Reinforcement Learning

stable-baselines3

PandaReachDense-v3

deep-reinforcement-learning

Model card Files Files and versions

Francesco-A commited on Aug 19, 2023

Commit

61343e6

·

1 Parent(s): 4551cb9

Update README.md

Files changed (1) hide show

README.md +23 -4

README.md CHANGED Viewed

@@ -26,12 +26,31 @@ This is a trained model of a **A2C** agent playing **PandaReachDense-v3**
 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 ## Usage (with Stable-baselines3)
-TODO: Add your code
 ```python
-from stable_baselines3 import ...
 from huggingface_sb3 import load_from_hub
-...
 ```

 using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
 ## Usage (with Stable-baselines3)
 ```python
+from stable_baselines3 import A2C
 from huggingface_sb3 import load_from_hub
+model = load_from_hub(repo_id='Francesco-A/a2c-PandaReachDense-v3',
+                     filename= 'a2c-PandaReachDense-v3.zip')
 ```
+## Training details (last output)
+------------------------------------
+| rollout/              |          |
+|    ep_len_mean        | 4.05     |
+|    ep_rew_mean        | -0.317   |
+| time/                 |          |
+|    fps                | 378      |
+|    iterations         | 50000    |
+|    time_elapsed       | 2641     |
+|    total_timesteps    | 1000000  |
+| train/                |          |
+|    entropy_loss       | 1.25     |
+|    explained_variance | 0.975    |
+|    learning_rate      | 0.0007   |
+|    n_updates          | 49999    |
+|    policy_loss        | -0.0935  |
+|    std                | 0.185    |
+|    value_loss         | 0.0306   |
+------------------------------------