iqbal1282's picture
Upload folder using huggingface_hub
d6d9b0b verified
[2025-08-05 12:28:57,402][863087] Saving configuration to /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/config.json...
[2025-08-05 12:28:57,410][863087] Rollout worker 0 uses device cpu
[2025-08-05 12:28:57,410][863087] Rollout worker 1 uses device cpu
[2025-08-05 12:28:57,411][863087] Rollout worker 2 uses device cpu
[2025-08-05 12:28:57,411][863087] Rollout worker 3 uses device cpu
[2025-08-05 12:28:57,411][863087] Rollout worker 4 uses device cpu
[2025-08-05 12:28:57,412][863087] Rollout worker 5 uses device cpu
[2025-08-05 12:28:57,412][863087] Rollout worker 6 uses device cpu
[2025-08-05 12:28:57,413][863087] Rollout worker 7 uses device cpu
[2025-08-05 12:28:57,453][863087] Using GPUs [0] for process 0 (actually maps to GPUs [0])
[2025-08-05 12:28:57,454][863087] InferenceWorker_p0-w0: min num requests: 2
[2025-08-05 12:28:57,468][863087] Starting all processes...
[2025-08-05 12:28:57,469][863087] Starting process learner_proc0
[2025-08-05 12:28:57,518][863087] Starting all processes...
[2025-08-05 12:28:57,523][863087] Starting process inference_proc0-0
[2025-08-05 12:28:57,523][863087] Starting process rollout_proc0
[2025-08-05 12:28:57,523][863087] Starting process rollout_proc1
[2025-08-05 12:28:57,523][863087] Starting process rollout_proc2
[2025-08-05 12:28:57,524][863087] Starting process rollout_proc3
[2025-08-05 12:28:57,524][863087] Starting process rollout_proc4
[2025-08-05 12:28:57,524][863087] Starting process rollout_proc5
[2025-08-05 12:28:57,525][863087] Starting process rollout_proc6
[2025-08-05 12:28:57,525][863087] Starting process rollout_proc7
[2025-08-05 12:28:59,390][863230] Using GPUs [0] for process 0 (actually maps to GPUs [0])
[2025-08-05 12:28:59,390][863230] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0
[2025-08-05 12:28:59,443][863230] Num visible devices: 1
[2025-08-05 12:28:59,444][863230] Starting seed is not provided
[2025-08-05 12:28:59,445][863230] Using GPUs [0] for process 0 (actually maps to GPUs [0])
[2025-08-05 12:28:59,445][863230] Initializing actor-critic model on device cuda:0
[2025-08-05 12:28:59,445][863230] RunningMeanStd input shape: (3, 72, 128)
[2025-08-05 12:28:59,447][863230] RunningMeanStd input shape: (1,)
[2025-08-05 12:28:59,454][863230] ConvEncoder: input_channels=3
[2025-08-05 12:28:59,602][863230] Conv encoder output size: 512
[2025-08-05 12:28:59,605][863230] Policy head output size: 512
[2025-08-05 12:28:59,629][863230] Created Actor Critic model with architecture:
[2025-08-05 12:28:59,630][863230] ActorCriticSharedWeights(
(obs_normalizer): ObservationNormalizer(
(running_mean_std): RunningMeanStdDictInPlace(
(running_mean_std): ModuleDict(
(obs): RunningMeanStdInPlace()
)
)
)
(returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace)
(encoder): VizdoomEncoder(
(basic_encoder): ConvEncoder(
(enc): RecursiveScriptModule(
original_name=ConvEncoderImpl
(conv_head): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=Conv2d)
(1): RecursiveScriptModule(original_name=ELU)
(2): RecursiveScriptModule(original_name=Conv2d)
(3): RecursiveScriptModule(original_name=ELU)
(4): RecursiveScriptModule(original_name=Conv2d)
(5): RecursiveScriptModule(original_name=ELU)
)
(mlp_layers): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=Linear)
(1): RecursiveScriptModule(original_name=ELU)
)
)
)
)
(core): ModelCoreRNN(
(core): GRU(512, 512)
)
(decoder): MlpDecoder(
(mlp): Identity()
)
(critic_linear): Linear(in_features=512, out_features=1, bias=True)
(action_parameterization): ActionParameterizationDefault(
(distribution_linear): Linear(in_features=512, out_features=5, bias=True)
)
)
[2025-08-05 12:28:59,656][863246] Worker 2 uses CPU cores [6, 7, 8]
[2025-08-05 12:28:59,656][863245] Worker 0 uses CPU cores [0, 1, 2]
[2025-08-05 12:28:59,656][863251] Worker 7 uses CPU cores [21, 22, 23]
[2025-08-05 12:28:59,671][863248] Worker 5 uses CPU cores [15, 16, 17]
[2025-08-05 12:28:59,694][863249] Worker 4 uses CPU cores [12, 13, 14]
[2025-08-05 12:28:59,694][863243] Using GPUs [0] for process 0 (actually maps to GPUs [0])
[2025-08-05 12:28:59,694][863243] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0
[2025-08-05 12:28:59,723][863243] Num visible devices: 1
[2025-08-05 12:28:59,731][863244] Worker 1 uses CPU cores [3, 4, 5]
[2025-08-05 12:28:59,746][863250] Worker 6 uses CPU cores [18, 19, 20]
[2025-08-05 12:28:59,746][863247] Worker 3 uses CPU cores [9, 10, 11]
[2025-08-05 12:29:00,035][863230] Using optimizer <class 'torch.optim.adam.Adam'>
[2025-08-05 12:29:01,001][863230] No checkpoints found
[2025-08-05 12:29:01,001][863230] Did not load from checkpoint, starting from scratch!
[2025-08-05 12:29:01,001][863230] Initialized policy 0 weights for model version 0
[2025-08-05 12:29:01,005][863230] LearnerWorker_p0 finished initialization!
[2025-08-05 12:29:01,005][863230] Using GPUs [0] for process 0 (actually maps to GPUs [0])
[2025-08-05 12:29:01,171][863243] Unhandled exception CUDA error: invalid resource handle
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
in evt loop inference_proc0-0_evt_loop
[2025-08-05 12:29:05,235][863087] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:20,369][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:20,375][863087] Heartbeat connected on RolloutWorker_w4
[2025-08-05 12:29:20,377][863087] Heartbeat connected on RolloutWorker_w3
[2025-08-05 12:29:20,378][863087] Heartbeat connected on RolloutWorker_w5
[2025-08-05 12:29:20,380][863087] Heartbeat connected on RolloutWorker_w2
[2025-08-05 12:29:20,381][863087] Heartbeat connected on RolloutWorker_w0
[2025-08-05 12:29:20,382][863087] Heartbeat connected on RolloutWorker_w1
[2025-08-05 12:29:20,383][863087] Heartbeat connected on LearnerWorker_p0
[2025-08-05 12:29:20,383][863087] Heartbeat connected on RolloutWorker_w6
[2025-08-05 12:29:20,384][863087] Heartbeat connected on RolloutWorker_w7
[2025-08-05 12:29:20,384][863087] Heartbeat connected on Batcher_0
[2025-08-05 12:29:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:29:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:30:55,239][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:31:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:31:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:32:55,237][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:33:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:33:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:34:55,238][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:35:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:35:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:36:55,241][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:37:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:37:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:38:55,236][863087] Components not started: InferenceWorker_p0-w0, wait_time=600.0 seconds
[2025-08-05 12:38:55,237][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:39:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:39:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:40:55,246][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:41:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:41:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:42:55,243][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:43:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:43:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:44:55,243][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:45:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:40,265][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:45:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:15,342][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:50,424][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:46:55,242][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:47:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:25,517][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:47:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:00,777][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:35,885][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:48:55,236][863087] Components not started: InferenceWorker_p0-w0, wait_time=1200.0 seconds
[2025-08-05 12:48:55,237][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:49:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:11,052][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:46,242][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:49:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:21,369][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:56,556][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:50:56,560][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:51:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:31,743][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:51:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:07,168][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:42,568][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:52:55,244][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:53:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:17,452][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:30,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:40,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:52,941][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:53:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:00,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:10,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:27,470][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:35,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:45,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:50,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:55,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:54:55,242][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:55:03,100][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:15,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:20,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:25,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:55:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:12,789][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:47,434][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:56:55,238][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:57:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:05,235][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:22,969][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:57:58,256][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:00,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:05,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:10,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:15,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:20,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:25,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:30,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:35,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:40,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:45,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:50,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:55,234][863087] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 0.0. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0)
[2025-08-05 12:58:55,237][863087] Components not started: InferenceWorker_p0-w0, wait_time=1800.0 seconds
[2025-08-05 12:58:55,239][863087] Components take too long to start: InferenceWorker_p0-w0. Aborting the experiment!
[2025-08-05 12:58:55,240][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:58:55,246][863087] Component InferenceWorker_p0-w0 process died already! Don't wait for it.
[2025-08-05 12:58:55,248][863230] Stopping Batcher_0...
[2025-08-05 12:58:55,249][863087] Component RolloutWorker_w5 stopped!
[2025-08-05 12:58:55,250][863230] Loop batcher_evt_loop terminating...
[2025-08-05 12:58:55,252][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w1', 'RolloutWorker_w2', 'RolloutWorker_w3', 'RolloutWorker_w4', 'RolloutWorker_w6', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,245][863248] Stopping RolloutWorker_w5...
[2025-08-05 12:58:55,246][863246] Stopping RolloutWorker_w2...
[2025-08-05 12:58:55,246][863247] Stopping RolloutWorker_w3...
[2025-08-05 12:58:55,258][863248] Loop rollout_proc5_evt_loop terminating...
[2025-08-05 12:58:55,259][863246] Loop rollout_proc2_evt_loop terminating...
[2025-08-05 12:58:55,259][863247] Loop rollout_proc3_evt_loop terminating...
[2025-08-05 12:58:55,247][863244] Stopping RolloutWorker_w1...
[2025-08-05 12:58:55,260][863244] Loop rollout_proc1_evt_loop terminating...
[2025-08-05 12:58:55,253][863245] Stopping RolloutWorker_w0...
[2025-08-05 12:58:55,248][863250] Stopping RolloutWorker_w6...
[2025-08-05 12:58:55,253][863251] Stopping RolloutWorker_w7...
[2025-08-05 12:58:55,263][863245] Loop rollout_proc0_evt_loop terminating...
[2025-08-05 12:58:55,264][863250] Loop rollout_proc6_evt_loop terminating...
[2025-08-05 12:58:55,264][863087] Component RolloutWorker_w2 stopped!
[2025-08-05 12:58:55,264][863251] Loop rollout_proc7_evt_loop terminating...
[2025-08-05 12:58:55,265][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w1', 'RolloutWorker_w3', 'RolloutWorker_w4', 'RolloutWorker_w6', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,265][863087] Component RolloutWorker_w3 stopped!
[2025-08-05 12:58:55,267][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w1', 'RolloutWorker_w4', 'RolloutWorker_w6', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,252][863249] Stopping RolloutWorker_w4...
[2025-08-05 12:58:55,268][863087] Component RolloutWorker_w6 stopped!
[2025-08-05 12:58:55,269][863249] Loop rollout_proc4_evt_loop terminating...
[2025-08-05 12:58:55,269][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w1', 'RolloutWorker_w4', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,270][863087] Component RolloutWorker_w4 stopped!
[2025-08-05 12:58:55,271][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w1', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,271][863087] Component RolloutWorker_w1 stopped!
[2025-08-05 12:58:55,272][863087] Waiting for ['Batcher_0', 'LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,273][863087] Component Batcher_0 stopped!
[2025-08-05 12:58:55,273][863087] Waiting for ['LearnerWorker_p0', 'RolloutWorker_w0', 'RolloutWorker_w7'] to stop...
[2025-08-05 12:58:55,274][863087] Component RolloutWorker_w7 stopped!
[2025-08-05 12:58:55,275][863087] Waiting for ['LearnerWorker_p0', 'RolloutWorker_w0'] to stop...
[2025-08-05 12:58:55,276][863087] Component RolloutWorker_w0 stopped!
[2025-08-05 12:58:55,276][863087] Waiting for ['LearnerWorker_p0'] to stop...
[2025-08-05 12:58:55,316][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:58:55,376][863230] Saving /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 12:58:55,431][863230] Stopping LearnerWorker_p0...
[2025-08-05 12:58:55,431][863230] Loop learner_proc0_evt_loop terminating...
[2025-08-05 12:58:55,431][863087] Component LearnerWorker_p0 stopped!
[2025-08-05 12:58:55,432][863087] Waiting for process learner_proc0 to stop...
[2025-08-05 12:58:57,444][863087] Waiting for process inference_proc0-0 to join...
[2025-08-05 12:58:57,446][863087] Waiting for process rollout_proc0 to join...
[2025-08-05 12:58:57,448][863087] Waiting for process rollout_proc1 to join...
[2025-08-05 12:58:57,449][863087] Waiting for process rollout_proc2 to join...
[2025-08-05 12:58:57,450][863087] Waiting for process rollout_proc3 to join...
[2025-08-05 12:58:57,453][863087] Waiting for process rollout_proc4 to join...
[2025-08-05 12:58:57,455][863087] Waiting for process rollout_proc5 to join...
[2025-08-05 12:58:57,456][863087] Waiting for process rollout_proc6 to join...
[2025-08-05 12:58:57,457][863087] Waiting for process rollout_proc7 to join...
[2025-08-05 12:58:57,459][863087] Batcher 0 profile tree view:
[2025-08-05 12:58:57,460][863087] Learner 0 profile tree view:
[2025-08-05 12:58:57,460][863087] RolloutWorker_w0 profile tree view:
[2025-08-05 12:58:57,461][863087] RolloutWorker_w7 profile tree view:
[2025-08-05 12:58:57,462][863087] Loop Runner_EvtLoop terminating...
[2025-08-05 12:58:57,464][863087] Runner profile tree view:
main_loop: 1799.9959
[2025-08-05 12:58:57,465][863087] Collected {0: 0}, FPS: 0.0
[2025-08-05 14:58:55,019][863087] Loading existing experiment configuration from /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/config.json
[2025-08-05 14:58:55,020][863087] Overriding arg 'num_workers' with value 1 passed from command line
[2025-08-05 14:58:55,021][863087] Adding new argument 'no_render'=True that is not in the saved config file!
[2025-08-05 14:58:55,021][863087] Adding new argument 'save_video'=True that is not in the saved config file!
[2025-08-05 14:58:55,021][863087] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
[2025-08-05 14:58:55,022][863087] Adding new argument 'video_name'=None that is not in the saved config file!
[2025-08-05 14:58:55,022][863087] Adding new argument 'max_num_frames'=1000000000.0 that is not in the saved config file!
[2025-08-05 14:58:55,023][863087] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
[2025-08-05 14:58:55,023][863087] Adding new argument 'push_to_hub'=False that is not in the saved config file!
[2025-08-05 14:58:55,024][863087] Adding new argument 'hf_repository'=None that is not in the saved config file!
[2025-08-05 14:58:55,024][863087] Adding new argument 'policy_index'=0 that is not in the saved config file!
[2025-08-05 14:58:55,025][863087] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
[2025-08-05 14:58:55,026][863087] Adding new argument 'train_script'=None that is not in the saved config file!
[2025-08-05 14:58:55,026][863087] Adding new argument 'enjoy_script'=None that is not in the saved config file!
[2025-08-05 14:58:55,026][863087] Using frameskip 1 and render_action_repeat=4 for evaluation
[2025-08-05 14:58:55,058][863087] Doom resolution: 160x120, resize resolution: (128, 72)
[2025-08-05 14:58:55,063][863087] RunningMeanStd input shape: (3, 72, 128)
[2025-08-05 14:58:55,071][863087] RunningMeanStd input shape: (1,)
[2025-08-05 14:58:55,095][863087] ConvEncoder: input_channels=3
[2025-08-05 14:58:55,192][863087] Conv encoder output size: 512
[2025-08-05 14:58:55,197][863087] Policy head output size: 512
[2025-08-05 14:58:55,525][863087] Loading state from checkpoint /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 14:58:56,610][863087] Num frames 100...
[2025-08-05 14:58:56,673][863087] Num frames 200...
[2025-08-05 14:58:56,737][863087] Num frames 300...
[2025-08-05 14:58:56,842][863087] Avg episode rewards: #0: 3.840, true rewards: #0: 3.840
[2025-08-05 14:58:56,843][863087] Avg episode reward: 3.840, avg true_objective: 3.840
[2025-08-05 14:58:56,859][863087] Num frames 400...
[2025-08-05 14:58:56,926][863087] Num frames 500...
[2025-08-05 14:58:56,991][863087] Num frames 600...
[2025-08-05 14:58:57,056][863087] Num frames 700...
[2025-08-05 14:58:57,150][863087] Avg episode rewards: #0: 3.840, true rewards: #0: 3.840
[2025-08-05 14:58:57,153][863087] Avg episode reward: 3.840, avg true_objective: 3.840
[2025-08-05 14:58:57,177][863087] Num frames 800...
[2025-08-05 14:58:57,236][863087] Num frames 900...
[2025-08-05 14:58:57,297][863087] Num frames 1000...
[2025-08-05 14:58:57,357][863087] Num frames 1100...
[2025-08-05 14:58:57,441][863087] Avg episode rewards: #0: 3.840, true rewards: #0: 3.840
[2025-08-05 14:58:57,444][863087] Avg episode reward: 3.840, avg true_objective: 3.840
[2025-08-05 14:58:57,480][863087] Num frames 1200...
[2025-08-05 14:58:57,539][863087] Num frames 1300...
[2025-08-05 14:58:57,599][863087] Num frames 1400...
[2025-08-05 14:58:57,668][863087] Num frames 1500...
[2025-08-05 14:58:57,743][863087] Avg episode rewards: #0: 3.840, true rewards: #0: 3.840
[2025-08-05 14:58:57,745][863087] Avg episode reward: 3.840, avg true_objective: 3.840
[2025-08-05 14:58:57,789][863087] Num frames 1600...
[2025-08-05 14:58:57,851][863087] Num frames 1700...
[2025-08-05 14:58:57,909][863087] Num frames 1800...
[2025-08-05 14:58:57,969][863087] Num frames 1900...
[2025-08-05 14:58:58,054][863087] Avg episode rewards: #0: 3.904, true rewards: #0: 3.904
[2025-08-05 14:58:58,057][863087] Avg episode reward: 3.904, avg true_objective: 3.904
[2025-08-05 14:58:58,091][863087] Num frames 2000...
[2025-08-05 14:58:58,151][863087] Num frames 2100...
[2025-08-05 14:58:58,214][863087] Num frames 2200...
[2025-08-05 14:58:58,279][863087] Num frames 2300...
[2025-08-05 14:58:58,344][863087] Num frames 2400...
[2025-08-05 14:58:58,456][863087] Avg episode rewards: #0: 4.493, true rewards: #0: 4.160
[2025-08-05 14:58:58,458][863087] Avg episode reward: 4.493, avg true_objective: 4.160
[2025-08-05 14:58:58,467][863087] Num frames 2500...
[2025-08-05 14:58:58,528][863087] Num frames 2600...
[2025-08-05 14:58:58,590][863087] Num frames 2700...
[2025-08-05 14:58:58,650][863087] Num frames 2800...
[2025-08-05 14:58:58,711][863087] Num frames 2900...
[2025-08-05 14:58:58,791][863087] Avg episode rewards: #0: 4.634, true rewards: #0: 4.206
[2025-08-05 14:58:58,793][863087] Avg episode reward: 4.634, avg true_objective: 4.206
[2025-08-05 14:58:58,831][863087] Num frames 3000...
[2025-08-05 14:58:58,893][863087] Num frames 3100...
[2025-08-05 14:58:58,954][863087] Num frames 3200...
[2025-08-05 14:58:59,015][863087] Num frames 3300...
[2025-08-05 14:58:59,128][863087] Avg episode rewards: #0: 4.740, true rewards: #0: 4.240
[2025-08-05 14:58:59,131][863087] Avg episode reward: 4.740, avg true_objective: 4.240
[2025-08-05 14:58:59,140][863087] Num frames 3400...
[2025-08-05 14:58:59,200][863087] Num frames 3500...
[2025-08-05 14:58:59,259][863087] Num frames 3600...
[2025-08-05 14:58:59,319][863087] Num frames 3700...
[2025-08-05 14:58:59,416][863087] Avg episode rewards: #0: 4.640, true rewards: #0: 4.196
[2025-08-05 14:58:59,419][863087] Avg episode reward: 4.640, avg true_objective: 4.196
[2025-08-05 14:58:59,438][863087] Num frames 3800...
[2025-08-05 14:58:59,499][863087] Num frames 3900...
[2025-08-05 14:58:59,560][863087] Num frames 4000...
[2025-08-05 14:58:59,621][863087] Num frames 4100...
[2025-08-05 14:58:59,682][863087] Num frames 4200...
[2025-08-05 14:58:59,742][863087] Num frames 4300...
[2025-08-05 14:58:59,808][863087] Avg episode rewards: #0: 4.920, true rewards: #0: 4.320
[2025-08-05 14:58:59,811][863087] Avg episode reward: 4.920, avg true_objective: 4.320
[2025-08-05 14:59:03,851][863087] Replay video saved to /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/replay.mp4!
[2025-08-05 15:01:11,317][863087] Loading existing experiment configuration from /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/config.json
[2025-08-05 15:01:11,318][863087] Overriding arg 'num_workers' with value 1 passed from command line
[2025-08-05 15:01:11,319][863087] Adding new argument 'no_render'=True that is not in the saved config file!
[2025-08-05 15:01:11,319][863087] Adding new argument 'save_video'=True that is not in the saved config file!
[2025-08-05 15:01:11,320][863087] Adding new argument 'video_frames'=1000000000.0 that is not in the saved config file!
[2025-08-05 15:01:11,320][863087] Adding new argument 'video_name'=None that is not in the saved config file!
[2025-08-05 15:01:11,321][863087] Adding new argument 'max_num_frames'=100000 that is not in the saved config file!
[2025-08-05 15:01:11,321][863087] Adding new argument 'max_num_episodes'=10 that is not in the saved config file!
[2025-08-05 15:01:11,322][863087] Adding new argument 'push_to_hub'=True that is not in the saved config file!
[2025-08-05 15:01:11,322][863087] Adding new argument 'hf_repository'='iqbal1282/rl_course_vizdoom_health_gathering_supreme' that is not in the saved config file!
[2025-08-05 15:01:11,323][863087] Adding new argument 'policy_index'=0 that is not in the saved config file!
[2025-08-05 15:01:11,323][863087] Adding new argument 'eval_deterministic'=False that is not in the saved config file!
[2025-08-05 15:01:11,324][863087] Adding new argument 'train_script'=None that is not in the saved config file!
[2025-08-05 15:01:11,324][863087] Adding new argument 'enjoy_script'=None that is not in the saved config file!
[2025-08-05 15:01:11,325][863087] Using frameskip 1 and render_action_repeat=4 for evaluation
[2025-08-05 15:01:11,337][863087] RunningMeanStd input shape: (3, 72, 128)
[2025-08-05 15:01:11,338][863087] RunningMeanStd input shape: (1,)
[2025-08-05 15:01:11,344][863087] ConvEncoder: input_channels=3
[2025-08-05 15:01:11,363][863087] Conv encoder output size: 512
[2025-08-05 15:01:11,366][863087] Policy head output size: 512
[2025-08-05 15:01:11,386][863087] Loading state from checkpoint /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/checkpoint_p0/checkpoint_000000000_0.pth...
[2025-08-05 15:01:11,818][863087] Num frames 100...
[2025-08-05 15:01:11,921][863087] Num frames 200...
[2025-08-05 15:01:12,024][863087] Num frames 300...
[2025-08-05 15:01:12,128][863087] Num frames 400...
[2025-08-05 15:01:12,227][863087] Avg episode rewards: #0: 5.480, true rewards: #0: 4.480
[2025-08-05 15:01:12,230][863087] Avg episode reward: 5.480, avg true_objective: 4.480
[2025-08-05 15:01:12,278][863087] Num frames 500...
[2025-08-05 15:01:12,365][863087] Num frames 600...
[2025-08-05 15:01:12,445][863087] Num frames 700...
[2025-08-05 15:01:12,531][863087] Num frames 800...
[2025-08-05 15:01:12,582][863087] Avg episode rewards: #0: 5.000, true rewards: #0: 4.000
[2025-08-05 15:01:12,583][863087] Avg episode reward: 5.000, avg true_objective: 4.000
[2025-08-05 15:01:12,669][863087] Num frames 900...
[2025-08-05 15:01:12,743][863087] Num frames 1000...
[2025-08-05 15:01:12,834][863087] Num frames 1100...
[2025-08-05 15:01:12,917][863087] Num frames 1200...
[2025-08-05 15:01:13,009][863087] Avg episode rewards: #0: 5.160, true rewards: #0: 4.160
[2025-08-05 15:01:13,010][863087] Avg episode reward: 5.160, avg true_objective: 4.160
[2025-08-05 15:01:13,053][863087] Num frames 1300...
[2025-08-05 15:01:13,142][863087] Num frames 1400...
[2025-08-05 15:01:13,231][863087] Num frames 1500...
[2025-08-05 15:01:13,309][863087] Num frames 1600...
[2025-08-05 15:01:13,385][863087] Avg episode rewards: #0: 4.830, true rewards: #0: 4.080
[2025-08-05 15:01:13,388][863087] Avg episode reward: 4.830, avg true_objective: 4.080
[2025-08-05 15:01:13,458][863087] Num frames 1700...
[2025-08-05 15:01:13,536][863087] Num frames 1800...
[2025-08-05 15:01:13,605][863087] Num frames 1900...
[2025-08-05 15:01:13,683][863087] Num frames 2000...
[2025-08-05 15:01:13,785][863087] Avg episode rewards: #0: 4.960, true rewards: #0: 4.160
[2025-08-05 15:01:13,787][863087] Avg episode reward: 4.960, avg true_objective: 4.160
[2025-08-05 15:01:13,805][863087] Num frames 2100...
[2025-08-05 15:01:13,876][863087] Num frames 2200...
[2025-08-05 15:01:13,953][863087] Num frames 2300...
[2025-08-05 15:01:14,021][863087] Num frames 2400...
[2025-08-05 15:01:14,073][863087] Avg episode rewards: #0: 4.833, true rewards: #0: 4.000
[2025-08-05 15:01:14,075][863087] Avg episode reward: 4.833, avg true_objective: 4.000
[2025-08-05 15:01:14,142][863087] Num frames 2500...
[2025-08-05 15:01:14,203][863087] Num frames 2600...
[2025-08-05 15:01:14,263][863087] Num frames 2700...
[2025-08-05 15:01:14,366][863087] Avg episode rewards: #0: 4.691, true rewards: #0: 3.977
[2025-08-05 15:01:14,368][863087] Avg episode reward: 4.691, avg true_objective: 3.977
[2025-08-05 15:01:14,382][863087] Num frames 2800...
[2025-08-05 15:01:14,454][863087] Num frames 2900...
[2025-08-05 15:01:14,523][863087] Num frames 3000...
[2025-08-05 15:01:14,588][863087] Num frames 3100...
[2025-08-05 15:01:14,654][863087] Num frames 3200...
[2025-08-05 15:01:14,722][863087] Num frames 3300...
[2025-08-05 15:01:14,793][863087] Avg episode rewards: #0: 5.035, true rewards: #0: 4.160
[2025-08-05 15:01:14,796][863087] Avg episode reward: 5.035, avg true_objective: 4.160
[2025-08-05 15:01:14,849][863087] Num frames 3400...
[2025-08-05 15:01:14,927][863087] Num frames 3500...
[2025-08-05 15:01:15,005][863087] Num frames 3600...
[2025-08-05 15:01:15,090][863087] Num frames 3700...
[2025-08-05 15:01:15,151][863087] Avg episode rewards: #0: 4.902, true rewards: #0: 4.124
[2025-08-05 15:01:15,153][863087] Avg episode reward: 4.902, avg true_objective: 4.124
[2025-08-05 15:01:15,231][863087] Num frames 3800...
[2025-08-05 15:01:15,310][863087] Num frames 3900...
[2025-08-05 15:01:15,384][863087] Num frames 4000...
[2025-08-05 15:01:15,470][863087] Num frames 4100...
[2025-08-05 15:01:15,546][863087] Num frames 4200...
[2025-08-05 15:01:15,626][863087] Num frames 4300...
[2025-08-05 15:01:15,708][863087] Num frames 4400...
[2025-08-05 15:01:15,802][863087] Avg episode rewards: #0: 5.648, true rewards: #0: 4.448
[2025-08-05 15:01:15,803][863087] Avg episode reward: 5.648, avg true_objective: 4.448
[2025-08-05 15:01:19,837][863087] Replay video saved to /mnt/c/Users/mdiqb/Desktop/reinforce_learning/reinforcement_learning_huggingface/train_dir/default_experiment/replay.mp4!