|
Training 1/1 epoch (loss 3.1153): 0%| | 0/625 [00:07<?, ?it/s]
Training 1/1 epoch (loss 3.1153): 0%| | 1/625 [00:07<1:20:48, 7.77s/it]
Training 1/1 epoch (loss 3.3717): 0%| | 1/625 [00:11<1:20:48, 7.77s/it]
Training 1/1 epoch (loss 3.3717): 0%| | 2/625 [00:11<53:36, 5.16s/it]
Training 1/1 epoch (loss 3.1839): 0%| | 2/625 [00:12<53:36, 5.16s/it]
Training 1/1 epoch (loss 3.1839): 0%| | 3/625 [00:12<33:40, 3.25s/it]
Training 1/1 epoch (loss 3.0328): 0%| | 3/625 [00:13<33:40, 3.25s/it]
Training 1/1 epoch (loss 3.0328): 1%| | 4/625 [00:13<24:51, 2.40s/it]
Training 1/1 epoch (loss 3.3355): 1%| | 4/625 [00:13<24:51, 2.40s/it]
Training 1/1 epoch (loss 3.3355): 1%| | 5/625 [00:13<17:47, 1.72s/it]
Training 1/1 epoch (loss 3.2150): 1%| | 5/625 [00:14<17:47, 1.72s/it]
Training 1/1 epoch (loss 3.2150): 1%| | 6/625 [00:14<14:57, 1.45s/it]
Training 1/1 epoch (loss 3.2929): 1%| | 6/625 [00:15<14:57, 1.45s/it]
Training 1/1 epoch (loss 3.2929): 1%| | 7/625 [00:15<13:22, 1.30s/it]
Training 1/1 epoch (loss 3.5700): 1%| | 7/625 [00:16<13:22, 1.30s/it]
Training 1/1 epoch (loss 3.5700): 1%|β | 8/625 [00:16<11:45, 1.14s/it]
Training 1/1 epoch (loss 3.1841): 1%|β | 8/625 [00:17<11:45, 1.14s/it]
Training 1/1 epoch (loss 3.1841): 1%|β | 9/625 [00:17<11:34, 1.13s/it]
Training 1/1 epoch (loss 3.5054): 1%|β | 9/625 [00:18<11:34, 1.13s/it]
Training 1/1 epoch (loss 3.5054): 2%|β | 10/625 [00:18<10:44, 1.05s/it]
Training 1/1 epoch (loss 3.0610): 2%|β | 10/625 [00:19<10:44, 1.05s/it]
Training 1/1 epoch (loss 3.0610): 2%|β | 11/625 [00:19<10:03, 1.02it/s]
Training 1/1 epoch (loss 3.1592): 2%|β | 11/625 [00:19<10:03, 1.02it/s]
Training 1/1 epoch (loss 3.1592): 2%|β | 12/625 [00:19<08:54, 1.15it/s]
Training 1/1 epoch (loss 3.3087): 2%|β | 12/625 [00:20<08:54, 1.15it/s]
Training 1/1 epoch (loss 3.3087): 2%|β | 13/625 [00:20<09:25, 1.08it/s]
Training 1/1 epoch (loss 3.6334): 2%|β | 13/625 [00:21<09:25, 1.08it/s]
Training 1/1 epoch (loss 3.6334): 2%|β | 14/625 [00:21<09:08, 1.11it/s]
Training 1/1 epoch (loss 3.1963): 2%|β | 14/625 [00:22<09:08, 1.11it/s]
Training 1/1 epoch (loss 3.1963): 2%|β | 15/625 [00:22<08:48, 1.15it/s]
Training 1/1 epoch (loss 3.5141): 2%|β | 15/625 [00:23<08:48, 1.15it/s]
Training 1/1 epoch (loss 3.5141): 3%|β | 16/625 [00:23<10:10, 1.00s/it]
Training 1/1 epoch (loss 3.3295): 3%|β | 16/625 [00:24<10:10, 1.00s/it]
Training 1/1 epoch (loss 3.3295): 3%|β | 17/625 [00:24<09:42, 1.04it/s]
Training 1/1 epoch (loss 3.2582): 3%|β | 17/625 [00:25<09:42, 1.04it/s]
Training 1/1 epoch (loss 3.2582): 3%|β | 18/625 [00:25<08:17, 1.22it/s]
Training 1/1 epoch (loss 3.4199): 3%|β | 18/625 [00:25<08:17, 1.22it/s]
Training 1/1 epoch (loss 3.4199): 3%|β | 19/625 [00:25<07:55, 1.28it/s]
Training 1/1 epoch (loss 3.3523): 3%|β | 19/625 [00:27<07:55, 1.28it/s]
Training 1/1 epoch (loss 3.3523): 3%|β | 20/625 [00:27<09:08, 1.10it/s]
Training 1/1 epoch (loss 3.0851): 3%|β | 20/625 [00:28<09:08, 1.10it/s]
Training 1/1 epoch (loss 3.0851): 3%|β | 21/625 [00:28<09:13, 1.09it/s]
Training 1/1 epoch (loss 3.3383): 3%|β | 21/625 [00:28<09:13, 1.09it/s]
Training 1/1 epoch (loss 3.3383): 4%|β | 22/625 [00:28<07:46, 1.29it/s]
Training 1/1 epoch (loss 3.2669): 4%|β | 22/625 [00:29<07:46, 1.29it/s]
Training 1/1 epoch (loss 3.2669): 4%|β | 23/625 [00:29<08:40, 1.16it/s]
Training 1/1 epoch (loss 3.2607): 4%|β | 23/625 [00:30<08:40, 1.16it/s]
Training 1/1 epoch (loss 3.2607): 4%|β | 24/625 [00:30<10:16, 1.03s/it]
Training 1/1 epoch (loss 3.0717): 4%|β | 24/625 [00:31<10:16, 1.03s/it]
Training 1/1 epoch (loss 3.0717): 4%|β | 25/625 [00:31<09:11, 1.09it/s]
Training 1/1 epoch (loss 3.4014): 4%|β | 25/625 [00:33<09:11, 1.09it/s]
Training 1/1 epoch (loss 3.4014): 4%|β | 26/625 [00:33<10:39, 1.07s/it]
Training 1/1 epoch (loss 2.8856): 4%|β | 26/625 [00:34<10:39, 1.07s/it]
Training 1/1 epoch (loss 2.8856): 4%|β | 27/625 [00:34<10:33, 1.06s/it]
Training 1/1 epoch (loss 3.3429): 4%|β | 27/625 [00:34<10:33, 1.06s/it]
Training 1/1 epoch (loss 3.3429): 4%|β | 28/625 [00:34<09:34, 1.04it/s]
Training 1/1 epoch (loss 3.1277): 4%|β | 28/625 [00:35<09:34, 1.04it/s]
Training 1/1 epoch (loss 3.1277): 5%|β | 29/625 [00:35<08:29, 1.17it/s]
Training 1/1 epoch (loss 3.2933): 5%|β | 29/625 [00:36<08:29, 1.17it/s]
Training 1/1 epoch (loss 3.2933): 5%|β | 30/625 [00:36<09:04, 1.09it/s]
Training 1/1 epoch (loss 3.0378): 5%|β | 30/625 [00:37<09:04, 1.09it/s]
Training 1/1 epoch (loss 3.0378): 5%|β | 31/625 [00:37<09:00, 1.10it/s]
Training 1/1 epoch (loss 3.3121): 5%|β | 31/625 [00:38<09:00, 1.10it/s]
Training 1/1 epoch (loss 3.3121): 5%|β | 32/625 [00:38<08:40, 1.14it/s]
Training 1/1 epoch (loss 3.4636): 5%|β | 32/625 [00:39<08:40, 1.14it/s]
Training 1/1 epoch (loss 3.4636): 5%|β | 33/625 [00:39<09:06, 1.08it/s]
Training 1/1 epoch (loss 3.2275): 5%|β | 33/625 [00:40<09:06, 1.08it/s]
Training 1/1 epoch (loss 3.2275): 5%|β | 34/625 [00:40<09:27, 1.04it/s]
Training 1/1 epoch (loss 3.1639): 5%|β | 34/625 [00:40<09:27, 1.04it/s]
Training 1/1 epoch (loss 3.1639): 6%|β | 35/625 [00:40<08:17, 1.19it/s]
Training 1/1 epoch (loss 3.1517): 6%|β | 35/625 [00:41<08:17, 1.19it/s]
Training 1/1 epoch (loss 3.1517): 6%|β | 36/625 [00:41<07:20, 1.34it/s]
Training 1/1 epoch (loss 3.5847): 6%|β | 36/625 [00:42<07:20, 1.34it/s]
Training 1/1 epoch (loss 3.5847): 6%|β | 37/625 [00:42<07:51, 1.25it/s]
Training 1/1 epoch (loss 3.1591): 6%|β | 37/625 [00:43<07:51, 1.25it/s]
Training 1/1 epoch (loss 3.1591): 6%|β | 38/625 [00:43<08:11, 1.19it/s]
Training 1/1 epoch (loss 3.5028): 6%|β | 38/625 [00:44<08:11, 1.19it/s]
Training 1/1 epoch (loss 3.5028): 6%|β | 39/625 [00:44<08:09, 1.20it/s]
Training 1/1 epoch (loss 3.2955): 6%|β | 39/625 [00:45<08:09, 1.20it/s]
Training 1/1 epoch (loss 3.2955): 6%|β | 40/625 [00:45<10:12, 1.05s/it]
Training 1/1 epoch (loss 3.4230): 6%|β | 40/625 [00:46<10:12, 1.05s/it]
Training 1/1 epoch (loss 3.4230): 7%|β | 41/625 [00:46<10:15, 1.05s/it]
Training 1/1 epoch (loss 3.4170): 7%|β | 41/625 [00:47<10:15, 1.05s/it]
Training 1/1 epoch (loss 3.4170): 7%|β | 42/625 [00:47<09:15, 1.05it/s]
Training 1/1 epoch (loss 3.2069): 7%|β | 42/625 [00:48<09:15, 1.05it/s]
Training 1/1 epoch (loss 3.2069): 7%|β | 43/625 [00:48<09:03, 1.07it/s]
Training 1/1 epoch (loss 3.2636): 7%|β | 43/625 [00:49<09:03, 1.07it/s]
Training 1/1 epoch (loss 3.2636): 7%|β | 44/625 [00:49<08:56, 1.08it/s]
Training 1/1 epoch (loss 3.4947): 7%|β | 44/625 [00:50<08:56, 1.08it/s]
Training 1/1 epoch (loss 3.4947): 7%|β | 45/625 [00:50<08:56, 1.08it/s]
Training 1/1 epoch (loss 3.1504): 7%|β | 45/625 [00:50<08:56, 1.08it/s]
Training 1/1 epoch (loss 3.1504): 7%|β | 46/625 [00:50<07:22, 1.31it/s]
Training 1/1 epoch (loss 3.1192): 7%|β | 46/625 [00:51<07:22, 1.31it/s]
Training 1/1 epoch (loss 3.1192): 8%|β | 47/625 [00:51<07:53, 1.22it/s]
Training 1/1 epoch (loss 3.0997): 8%|β | 47/625 [00:52<07:53, 1.22it/s]
Training 1/1 epoch (loss 3.0997): 8%|β | 48/625 [00:52<09:09, 1.05it/s]
Training 1/1 epoch (loss 3.5444): 8%|β | 48/625 [00:53<09:09, 1.05it/s]
Training 1/1 epoch (loss 3.5444): 8%|β | 49/625 [00:53<08:50, 1.09it/s]
Training 1/1 epoch (loss 3.5871): 8%|β | 49/625 [00:54<08:50, 1.09it/s]
Training 1/1 epoch (loss 3.5871): 8%|β | 50/625 [00:54<09:16, 1.03it/s]
Training 1/1 epoch (loss 3.4341): 8%|β | 50/625 [00:55<09:16, 1.03it/s]
Training 1/1 epoch (loss 3.4341): 8%|β | 51/625 [00:55<09:25, 1.01it/s]
Training 1/1 epoch (loss 3.2995): 8%|β | 51/625 [00:56<09:25, 1.01it/s]
Training 1/1 epoch (loss 3.2995): 8%|β | 52/625 [00:56<09:18, 1.03it/s]
Training 1/1 epoch (loss 3.0886): 8%|β | 52/625 [00:56<09:18, 1.03it/s]
Training 1/1 epoch (loss 3.0886): 8%|β | 53/625 [00:56<07:38, 1.25it/s]
Training 1/1 epoch (loss 2.9295): 8%|β | 53/625 [00:57<07:38, 1.25it/s]
Training 1/1 epoch (loss 2.9295): 9%|β | 54/625 [00:57<07:54, 1.20it/s]
Training 1/1 epoch (loss 3.4180): 9%|β | 54/625 [00:59<07:54, 1.20it/s]
Training 1/1 epoch (loss 3.4180): 9%|β | 55/625 [00:59<08:49, 1.08it/s]
Training 1/1 epoch (loss 3.2272): 9%|β | 55/625 [00:59<08:49, 1.08it/s]
Training 1/1 epoch (loss 3.2272): 9%|β | 56/625 [00:59<08:22, 1.13it/s]
Training 1/1 epoch (loss 3.2872): 9%|β | 56/625 [01:00<08:22, 1.13it/s]
Training 1/1 epoch (loss 3.2872): 9%|β | 57/625 [01:00<08:20, 1.14it/s]
Training 1/1 epoch (loss 3.2745): 9%|β | 57/625 [01:01<08:20, 1.14it/s]
Training 1/1 epoch (loss 3.2745): 9%|β | 58/625 [01:01<08:41, 1.09it/s]
Training 1/1 epoch (loss 3.0962): 9%|β | 58/625 [01:02<08:41, 1.09it/s]
Training 1/1 epoch (loss 3.0962): 9%|β | 59/625 [01:02<08:33, 1.10it/s]
Training 1/1 epoch (loss 3.1796): 9%|β | 59/625 [01:03<08:33, 1.10it/s]
Training 1/1 epoch (loss 3.1796): 10%|β | 60/625 [01:03<07:22, 1.28it/s]
Training 1/1 epoch (loss 3.1908): 10%|β | 60/625 [01:04<07:22, 1.28it/s]
Training 1/1 epoch (loss 3.1908): 10%|β | 61/625 [01:04<08:05, 1.16it/s]
Training 1/1 epoch (loss 3.3750): 10%|β | 61/625 [01:04<08:05, 1.16it/s]
Training 1/1 epoch (loss 3.3750): 10%|β | 62/625 [01:04<07:38, 1.23it/s]
Training 1/1 epoch (loss 3.2710): 10%|β | 62/625 [01:05<07:38, 1.23it/s]
Training 1/1 epoch (loss 3.2710): 10%|β | 63/625 [01:05<06:43, 1.39it/s]
Training 1/1 epoch (loss 3.1374): 10%|β | 63/625 [01:06<06:43, 1.39it/s]
Training 1/1 epoch (loss 3.1374): 10%|β | 64/625 [01:06<08:11, 1.14it/s]
Training 1/1 epoch (loss 3.0205): 10%|β | 64/625 [01:07<08:11, 1.14it/s]
Training 1/1 epoch (loss 3.0205): 10%|β | 65/625 [01:07<08:48, 1.06it/s]
Training 1/1 epoch (loss 3.3408): 10%|β | 65/625 [01:08<08:48, 1.06it/s]
Training 1/1 epoch (loss 3.3408): 11%|β | 66/625 [01:08<07:59, 1.17it/s]
Training 1/1 epoch (loss 3.3353): 11%|β | 66/625 [01:09<07:59, 1.17it/s]
Training 1/1 epoch (loss 3.3353): 11%|β | 67/625 [01:09<07:52, 1.18it/s]
Training 1/1 epoch (loss 3.1729): 11%|β | 67/625 [01:09<07:52, 1.18it/s]
Training 1/1 epoch (loss 3.1729): 11%|β | 68/625 [01:09<07:56, 1.17it/s]
Training 1/1 epoch (loss 3.4056): 11%|β | 68/625 [01:10<07:56, 1.17it/s]
Training 1/1 epoch (loss 3.4056): 11%|β | 69/625 [01:10<08:03, 1.15it/s]
Training 1/1 epoch (loss 3.1378): 11%|β | 69/625 [01:11<08:03, 1.15it/s]
Training 1/1 epoch (loss 3.1378): 11%|β | 70/625 [01:11<06:50, 1.35it/s]
Training 1/1 epoch (loss 3.2357): 11%|β | 70/625 [01:12<06:50, 1.35it/s]
Training 1/1 epoch (loss 3.2357): 11%|ββ | 71/625 [01:12<07:19, 1.26it/s]
Training 1/1 epoch (loss 3.0163): 11%|ββ | 71/625 [01:13<07:19, 1.26it/s]
Training 1/1 epoch (loss 3.0163): 12%|ββ | 72/625 [01:13<09:04, 1.02it/s]
Training 1/1 epoch (loss 3.1525): 12%|ββ | 72/625 [01:14<09:04, 1.02it/s]
Training 1/1 epoch (loss 3.1525): 12%|ββ | 73/625 [01:14<08:38, 1.06it/s]
Training 1/1 epoch (loss 3.3753): 12%|ββ | 73/625 [01:15<08:38, 1.06it/s]
Training 1/1 epoch (loss 3.3753): 12%|ββ | 74/625 [01:15<08:05, 1.13it/s]
Training 1/1 epoch (loss 3.0997): 12%|ββ | 74/625 [01:16<08:05, 1.13it/s]
Training 1/1 epoch (loss 3.0997): 12%|ββ | 75/625 [01:16<07:55, 1.16it/s]
Training 1/1 epoch (loss 3.2067): 12%|ββ | 75/625 [01:16<07:55, 1.16it/s]
Training 1/1 epoch (loss 3.2067): 12%|ββ | 76/625 [01:16<07:51, 1.16it/s]
Training 1/1 epoch (loss 3.2302): 12%|ββ | 76/625 [01:17<07:51, 1.16it/s]
Training 1/1 epoch (loss 3.2302): 12%|ββ | 77/625 [01:17<06:59, 1.31it/s]
Training 1/1 epoch (loss 3.2396): 12%|ββ | 77/625 [01:18<06:59, 1.31it/s]
Training 1/1 epoch (loss 3.2396): 12%|ββ | 78/625 [01:18<07:53, 1.16it/s]
Training 1/1 epoch (loss 3.3587): 12%|ββ | 78/625 [01:19<07:53, 1.16it/s]
Training 1/1 epoch (loss 3.3587): 13%|ββ | 79/625 [01:19<07:58, 1.14it/s]
Training 1/1 epoch (loss 3.2440): 13%|ββ | 79/625 [01:20<07:58, 1.14it/s]
Training 1/1 epoch (loss 3.2440): 13%|ββ | 80/625 [01:20<07:33, 1.20it/s]
Training 1/1 epoch (loss 3.3643): 13%|ββ | 80/625 [01:21<07:33, 1.20it/s]
Training 1/1 epoch (loss 3.3643): 13%|ββ | 81/625 [01:21<07:31, 1.21it/s]
Training 1/1 epoch (loss 3.1679): 13%|ββ | 81/625 [01:22<07:31, 1.21it/s]
Training 1/1 epoch (loss 3.1679): 13%|ββ | 82/625 [01:22<08:05, 1.12it/s]
Training 1/1 epoch (loss 3.2340): 13%|ββ | 82/625 [01:22<08:05, 1.12it/s]
Training 1/1 epoch (loss 3.2340): 13%|ββ | 83/625 [01:22<07:59, 1.13it/s]
Training 1/1 epoch (loss 2.9000): 13%|ββ | 83/625 [01:23<07:59, 1.13it/s]
Training 1/1 epoch (loss 2.9000): 13%|ββ | 84/625 [01:23<06:47, 1.33it/s]
Training 1/1 epoch (loss 3.3907): 13%|ββ | 84/625 [01:24<06:47, 1.33it/s]
Training 1/1 epoch (loss 3.3907): 14%|ββ | 85/625 [01:24<07:15, 1.24it/s]
Training 1/1 epoch (loss 3.4210): 14%|ββ | 85/625 [01:25<07:15, 1.24it/s]
Training 1/1 epoch (loss 3.4210): 14%|ββ | 86/625 [01:25<07:34, 1.19it/s]
Training 1/1 epoch (loss 3.3570): 14%|ββ | 86/625 [01:26<07:34, 1.19it/s]
Training 1/1 epoch (loss 3.3570): 14%|ββ | 87/625 [01:26<08:54, 1.01it/s]
Training 1/1 epoch (loss 3.4550): 14%|ββ | 87/625 [01:27<08:54, 1.01it/s]
Training 1/1 epoch (loss 3.4550): 14%|ββ | 88/625 [01:27<07:22, 1.21it/s]
Training 1/1 epoch (loss 3.4222): 14%|ββ | 88/625 [01:28<07:22, 1.21it/s]
Training 1/1 epoch (loss 3.4222): 14%|ββ | 89/625 [01:28<07:59, 1.12it/s]
Training 1/1 epoch (loss 3.2715): 14%|ββ | 89/625 [01:28<07:59, 1.12it/s]
Training 1/1 epoch (loss 3.2715): 14%|ββ | 90/625 [01:28<07:36, 1.17it/s]
Training 1/1 epoch (loss 3.1268): 14%|ββ | 90/625 [01:29<07:36, 1.17it/s]
Training 1/1 epoch (loss 3.1268): 15%|ββ | 91/625 [01:29<06:55, 1.28it/s]
Training 1/1 epoch (loss 3.1968): 15%|ββ | 91/625 [01:30<06:55, 1.28it/s]
Training 1/1 epoch (loss 3.1968): 15%|ββ | 92/625 [01:30<07:51, 1.13it/s]
Training 1/1 epoch (loss 3.1491): 15%|ββ | 92/625 [01:31<07:51, 1.13it/s]
Training 1/1 epoch (loss 3.1491): 15%|ββ | 93/625 [01:31<07:54, 1.12it/s]
Training 1/1 epoch (loss 3.3328): 15%|ββ | 93/625 [01:31<07:54, 1.12it/s]
Training 1/1 epoch (loss 3.3328): 15%|ββ | 94/625 [01:31<06:39, 1.33it/s]
Training 1/1 epoch (loss 3.4540): 15%|ββ | 94/625 [01:33<06:39, 1.33it/s]
Training 1/1 epoch (loss 3.4540): 15%|ββ | 95/625 [01:33<08:19, 1.06it/s]
Training 1/1 epoch (loss 2.9600): 15%|ββ | 95/625 [01:34<08:19, 1.06it/s]
Training 1/1 epoch (loss 2.9600): 15%|ββ | 96/625 [01:34<09:41, 1.10s/it]
Training 1/1 epoch (loss 3.2415): 15%|ββ | 96/625 [01:35<09:41, 1.10s/it]
Training 1/1 epoch (loss 3.2415): 16%|ββ | 97/625 [01:35<08:19, 1.06it/s]
Training 1/1 epoch (loss 3.2751): 16%|ββ | 97/625 [01:36<08:19, 1.06it/s]
Training 1/1 epoch (loss 3.2751): 16%|ββ | 98/625 [01:36<08:15, 1.06it/s]
Training 1/1 epoch (loss 3.3756): 16%|ββ | 98/625 [01:37<08:15, 1.06it/s]
Training 1/1 epoch (loss 3.3756): 16%|ββ | 99/625 [01:37<08:07, 1.08it/s]
Training 1/1 epoch (loss 3.2833): 16%|ββ | 99/625 [01:37<08:07, 1.08it/s]
Training 1/1 epoch (loss 3.2833): 16%|ββ | 100/625 [01:37<07:15, 1.21it/s]
Training 1/1 epoch (loss 3.2285): 16%|ββ | 100/625 [01:38<07:15, 1.21it/s]
Training 1/1 epoch (loss 3.2285): 16%|ββ | 101/625 [01:38<06:24, 1.36it/s]
Training 1/1 epoch (loss 3.3229): 16%|ββ | 101/625 [01:39<06:24, 1.36it/s]
Training 1/1 epoch (loss 3.3229): 16%|ββ | 102/625 [01:39<07:17, 1.20it/s]
Training 1/1 epoch (loss 3.2530): 16%|ββ | 102/625 [01:40<07:17, 1.20it/s]
Training 1/1 epoch (loss 3.2530): 16%|ββ | 103/625 [01:40<07:55, 1.10it/s]
Training 1/1 epoch (loss 3.2667): 16%|ββ | 103/625 [01:41<07:55, 1.10it/s]
Training 1/1 epoch (loss 3.2667): 17%|ββ | 104/625 [01:41<07:26, 1.17it/s]
Training 1/1 epoch (loss 3.1401): 17%|ββ | 104/625 [01:42<07:26, 1.17it/s]
Training 1/1 epoch (loss 3.1401): 17%|ββ | 105/625 [01:42<07:59, 1.09it/s]
Training 1/1 epoch (loss 3.0508): 17%|ββ | 105/625 [01:43<07:59, 1.09it/s]
Training 1/1 epoch (loss 3.0508): 17%|ββ | 106/625 [01:43<08:06, 1.07it/s]
Training 1/1 epoch (loss 3.5696): 17%|ββ | 106/625 [01:43<08:06, 1.07it/s]
Training 1/1 epoch (loss 3.5696): 17%|ββ | 107/625 [01:43<07:36, 1.13it/s]
Training 1/1 epoch (loss 3.2627): 17%|ββ | 107/625 [01:44<07:36, 1.13it/s]
Training 1/1 epoch (loss 3.2627): 17%|ββ | 108/625 [01:44<07:10, 1.20it/s]
Training 1/1 epoch (loss 3.1755): 17%|ββ | 108/625 [01:45<07:10, 1.20it/s]
Training 1/1 epoch (loss 3.1755): 17%|ββ | 109/625 [01:45<07:14, 1.19it/s]
Training 1/1 epoch (loss 3.4959): 17%|ββ | 109/625 [01:46<07:14, 1.19it/s]
Training 1/1 epoch (loss 3.4959): 18%|ββ | 110/625 [01:46<07:56, 1.08it/s]
Training 1/1 epoch (loss 3.4208): 18%|ββ | 110/625 [01:47<07:56, 1.08it/s]
Training 1/1 epoch (loss 3.4208): 18%|ββ | 111/625 [01:47<06:43, 1.27it/s]
Training 1/1 epoch (loss 3.3443): 18%|ββ | 111/625 [01:48<06:43, 1.27it/s]
Training 1/1 epoch (loss 3.3443): 18%|ββ | 112/625 [01:48<08:17, 1.03it/s]
Training 1/1 epoch (loss 3.1752): 18%|ββ | 112/625 [01:50<08:17, 1.03it/s]
Training 1/1 epoch (loss 3.1752): 18%|ββ | 113/625 [01:50<10:20, 1.21s/it]
Training 1/1 epoch (loss 3.2586): 18%|ββ | 113/625 [01:50<10:20, 1.21s/it]
Training 1/1 epoch (loss 3.2586): 18%|ββ | 114/625 [01:50<08:22, 1.02it/s]
Training 1/1 epoch (loss 3.3815): 18%|ββ | 114/625 [01:51<08:22, 1.02it/s]
Training 1/1 epoch (loss 3.3815): 18%|ββ | 115/625 [01:51<08:16, 1.03it/s]
Training 1/1 epoch (loss 3.4526): 18%|ββ | 115/625 [01:52<08:16, 1.03it/s]
Training 1/1 epoch (loss 3.4526): 19%|ββ | 116/625 [01:52<08:29, 1.00s/it]
Training 1/1 epoch (loss 3.4435): 19%|ββ | 116/625 [01:53<08:29, 1.00s/it]
Training 1/1 epoch (loss 3.4435): 19%|ββ | 117/625 [01:53<08:23, 1.01it/s]
Training 1/1 epoch (loss 2.9572): 19%|ββ | 117/625 [01:54<08:23, 1.01it/s]
Training 1/1 epoch (loss 2.9572): 19%|ββ | 118/625 [01:54<07:22, 1.14it/s]
Training 1/1 epoch (loss 3.0414): 19%|ββ | 118/625 [01:55<07:22, 1.14it/s]
Training 1/1 epoch (loss 3.0414): 19%|ββ | 119/625 [01:55<07:31, 1.12it/s]
Training 1/1 epoch (loss 3.1357): 19%|ββ | 119/625 [01:56<07:31, 1.12it/s]
Training 1/1 epoch (loss 3.1357): 19%|ββ | 120/625 [01:56<08:51, 1.05s/it]
Training 1/1 epoch (loss 3.0375): 19%|ββ | 120/625 [01:57<08:51, 1.05s/it]
Training 1/1 epoch (loss 3.0375): 19%|ββ | 121/625 [01:57<08:25, 1.00s/it]
Training 1/1 epoch (loss 3.1773): 19%|ββ | 121/625 [01:58<08:25, 1.00s/it]
Training 1/1 epoch (loss 3.1773): 20%|ββ | 122/625 [01:58<08:28, 1.01s/it]
Training 1/1 epoch (loss 3.0479): 20%|ββ | 122/625 [01:59<08:28, 1.01s/it]
Training 1/1 epoch (loss 3.0479): 20%|ββ | 123/625 [01:59<08:36, 1.03s/it]
Training 1/1 epoch (loss 3.3582): 20%|ββ | 123/625 [02:00<08:36, 1.03s/it]
Training 1/1 epoch (loss 3.3582): 20%|ββ | 124/625 [02:00<07:38, 1.09it/s]
Training 1/1 epoch (loss 3.0878): 20%|ββ | 124/625 [02:00<07:38, 1.09it/s]
Training 1/1 epoch (loss 3.0878): 20%|ββ | 125/625 [02:00<06:31, 1.28it/s]
Training 1/1 epoch (loss 3.3068): 20%|ββ | 125/625 [02:01<06:31, 1.28it/s]
Training 1/1 epoch (loss 3.3068): 20%|ββ | 126/625 [02:01<07:15, 1.15it/s]
Training 1/1 epoch (loss 3.1892): 20%|ββ | 126/625 [02:02<07:15, 1.15it/s]
Training 1/1 epoch (loss 3.1892): 20%|ββ | 127/625 [02:02<07:20, 1.13it/s]
Training 1/1 epoch (loss 3.1925): 20%|ββ | 127/625 [02:03<07:20, 1.13it/s]
Training 1/1 epoch (loss 3.1925): 20%|ββ | 128/625 [02:03<06:52, 1.21it/s]
Training 1/1 epoch (loss 2.9931): 20%|ββ | 128/625 [02:04<06:52, 1.21it/s]
Training 1/1 epoch (loss 2.9931): 21%|ββ | 129/625 [02:04<07:34, 1.09it/s]
Training 1/1 epoch (loss 3.2519): 21%|ββ | 129/625 [02:05<07:34, 1.09it/s]
Training 1/1 epoch (loss 3.2519): 21%|ββ | 130/625 [02:05<07:22, 1.12it/s]
Training 1/1 epoch (loss 2.9933): 21%|ββ | 130/625 [02:06<07:22, 1.12it/s]
Training 1/1 epoch (loss 2.9933): 21%|ββ | 131/625 [02:06<06:59, 1.18it/s]
Training 1/1 epoch (loss 3.3824): 21%|ββ | 131/625 [02:06<06:59, 1.18it/s]
Training 1/1 epoch (loss 3.3824): 21%|ββ | 132/625 [02:06<06:01, 1.36it/s]
Training 1/1 epoch (loss 3.2348): 21%|ββ | 132/625 [02:07<06:01, 1.36it/s]
Training 1/1 epoch (loss 3.2348): 21%|βββ | 133/625 [02:07<06:21, 1.29it/s]
Training 1/1 epoch (loss 3.1560): 21%|βββ | 133/625 [02:08<06:21, 1.29it/s]
Training 1/1 epoch (loss 3.1560): 21%|βββ | 134/625 [02:08<06:50, 1.20it/s]
Training 1/1 epoch (loss 3.2548): 21%|βββ | 134/625 [02:09<06:50, 1.20it/s]
Training 1/1 epoch (loss 3.2548): 22%|βββ | 135/625 [02:09<06:06, 1.34it/s]
Training 1/1 epoch (loss 3.2897): 22%|βββ | 135/625 [02:10<06:06, 1.34it/s]
Training 1/1 epoch (loss 3.2897): 22%|βββ | 136/625 [02:10<07:50, 1.04it/s]
Training 1/1 epoch (loss 3.2090): 22%|βββ | 136/625 [02:11<07:50, 1.04it/s]
Training 1/1 epoch (loss 3.2090): 22%|βββ | 137/625 [02:11<07:44, 1.05it/s]
Training 1/1 epoch (loss 3.1918): 22%|βββ | 137/625 [02:11<07:44, 1.05it/s]
Training 1/1 epoch (loss 3.1918): 22%|βββ | 138/625 [02:11<06:38, 1.22it/s]
Training 1/1 epoch (loss 3.2586): 22%|βββ | 138/625 [02:12<06:38, 1.22it/s]
Training 1/1 epoch (loss 3.2586): 22%|βββ | 139/625 [02:12<06:44, 1.20it/s]
Training 1/1 epoch (loss 3.0501): 22%|βββ | 139/625 [02:14<06:44, 1.20it/s]
Training 1/1 epoch (loss 3.0501): 22%|βββ | 140/625 [02:14<07:46, 1.04it/s]
Training 1/1 epoch (loss 3.0705): 22%|βββ | 140/625 [02:14<07:46, 1.04it/s]
Training 1/1 epoch (loss 3.0705): 23%|βββ | 141/625 [02:14<07:07, 1.13it/s]
Training 1/1 epoch (loss 3.1227): 23%|βββ | 141/625 [02:15<07:07, 1.13it/s]
Training 1/1 epoch (loss 3.1227): 23%|βββ | 142/625 [02:15<07:26, 1.08it/s]
Training 1/1 epoch (loss 3.3186): 23%|βββ | 142/625 [02:16<07:26, 1.08it/s]
Training 1/1 epoch (loss 3.3186): 23%|βββ | 143/625 [02:16<07:41, 1.05it/s]
Training 1/1 epoch (loss 3.2161): 23%|βββ | 143/625 [02:17<07:41, 1.05it/s]
Training 1/1 epoch (loss 3.2161): 23%|βββ | 144/625 [02:17<07:02, 1.14it/s]
Training 1/1 epoch (loss 3.4694): 23%|βββ | 144/625 [02:18<07:02, 1.14it/s]
Training 1/1 epoch (loss 3.4694): 23%|βββ | 145/625 [02:18<06:28, 1.23it/s]
Training 1/1 epoch (loss 3.2777): 23%|βββ | 145/625 [02:18<06:28, 1.23it/s]
Training 1/1 epoch (loss 3.2777): 23%|βββ | 146/625 [02:18<06:26, 1.24it/s]
Training 1/1 epoch (loss 3.0896): 23%|βββ | 146/625 [02:19<06:26, 1.24it/s]
Training 1/1 epoch (loss 3.0896): 24%|βββ | 147/625 [02:19<06:46, 1.18it/s]
Training 1/1 epoch (loss 3.1548): 24%|βββ | 147/625 [02:20<06:46, 1.18it/s]
Training 1/1 epoch (loss 3.1548): 24%|βββ | 148/625 [02:20<07:04, 1.12it/s]
Training 1/1 epoch (loss 3.2844): 24%|βββ | 148/625 [02:21<07:04, 1.12it/s]
Training 1/1 epoch (loss 3.2844): 24%|βββ | 149/625 [02:21<07:02, 1.13it/s]
Training 1/1 epoch (loss 3.2683): 24%|βββ | 149/625 [02:22<07:02, 1.13it/s]
Training 1/1 epoch (loss 3.2683): 24%|βββ | 150/625 [02:22<06:56, 1.14it/s]
Training 1/1 epoch (loss 3.3044): 24%|βββ | 150/625 [02:23<06:56, 1.14it/s]
Training 1/1 epoch (loss 3.3044): 24%|βββ | 151/625 [02:23<07:12, 1.10it/s]
Training 1/1 epoch (loss 3.2414): 24%|βββ | 151/625 [02:24<07:12, 1.10it/s]
Training 1/1 epoch (loss 3.2414): 24%|βββ | 152/625 [02:24<07:32, 1.05it/s]
Training 1/1 epoch (loss 3.1771): 24%|βββ | 152/625 [02:25<07:32, 1.05it/s]
Training 1/1 epoch (loss 3.1771): 24%|βββ | 153/625 [02:25<07:47, 1.01it/s]
Training 1/1 epoch (loss 3.3854): 24%|βββ | 153/625 [02:26<07:47, 1.01it/s]
Training 1/1 epoch (loss 3.3854): 25%|βββ | 154/625 [02:26<07:30, 1.05it/s]
Training 1/1 epoch (loss 3.3595): 25%|βββ | 154/625 [02:27<07:30, 1.05it/s]
Training 1/1 epoch (loss 3.3595): 25%|βββ | 155/625 [02:27<06:44, 1.16it/s]
Training 1/1 epoch (loss 3.2299): 25%|βββ | 155/625 [02:28<06:44, 1.16it/s]
Training 1/1 epoch (loss 3.2299): 25%|βββ | 156/625 [02:28<07:12, 1.09it/s]
Training 1/1 epoch (loss 3.3763): 25%|βββ | 156/625 [02:29<07:12, 1.09it/s]
Training 1/1 epoch (loss 3.3763): 25%|βββ | 157/625 [02:29<07:18, 1.07it/s]
Training 1/1 epoch (loss 3.2245): 25%|βββ | 157/625 [02:29<07:18, 1.07it/s]
Training 1/1 epoch (loss 3.2245): 25%|βββ | 158/625 [02:29<06:26, 1.21it/s]
Training 1/1 epoch (loss 3.2281): 25%|βββ | 158/625 [02:31<06:26, 1.21it/s]
Training 1/1 epoch (loss 3.2281): 25%|βββ | 159/625 [02:31<07:52, 1.01s/it]
Training 1/1 epoch (loss 2.9881): 25%|βββ | 159/625 [02:32<07:52, 1.01s/it]
Training 1/1 epoch (loss 2.9881): 26%|βββ | 160/625 [02:32<08:50, 1.14s/it]
Training 1/1 epoch (loss 3.1837): 26%|βββ | 160/625 [02:33<08:50, 1.14s/it]
Training 1/1 epoch (loss 3.1837): 26%|βββ | 161/625 [02:33<08:13, 1.06s/it]
Training 1/1 epoch (loss 3.2173): 26%|βββ | 161/625 [02:34<08:13, 1.06s/it]
Training 1/1 epoch (loss 3.2173): 26%|βββ | 162/625 [02:34<08:12, 1.06s/it]
Training 1/1 epoch (loss 3.2664): 26%|βββ | 162/625 [02:35<08:12, 1.06s/it]
Training 1/1 epoch (loss 3.2664): 26%|βββ | 163/625 [02:35<07:41, 1.00it/s]
Training 1/1 epoch (loss 3.3103): 26%|βββ | 163/625 [02:36<07:41, 1.00it/s]
Training 1/1 epoch (loss 3.3103): 26%|βββ | 164/625 [02:36<06:35, 1.16it/s]
Training 1/1 epoch (loss 3.2314): 26%|βββ | 164/625 [02:37<06:35, 1.16it/s]
Training 1/1 epoch (loss 3.2314): 26%|βββ | 165/625 [02:37<07:03, 1.09it/s]
Training 1/1 epoch (loss 3.1863): 26%|βββ | 165/625 [02:38<07:03, 1.09it/s]
Training 1/1 epoch (loss 3.1863): 27%|βββ | 166/625 [02:38<06:57, 1.10it/s]
Training 1/1 epoch (loss 3.0153): 27%|βββ | 166/625 [02:39<06:57, 1.10it/s]
Training 1/1 epoch (loss 3.0153): 27%|βββ | 167/625 [02:39<08:17, 1.09s/it]
Training 1/1 epoch (loss 3.2072): 27%|βββ | 167/625 [02:40<08:17, 1.09s/it]
Training 1/1 epoch (loss 3.2072): 27%|βββ | 168/625 [02:40<08:31, 1.12s/it]
Training 1/1 epoch (loss 2.9786): 27%|βββ | 168/625 [02:41<08:31, 1.12s/it]
Training 1/1 epoch (loss 2.9786): 27%|βββ | 169/625 [02:41<08:06, 1.07s/it]
Training 1/1 epoch (loss 3.3027): 27%|βββ | 169/625 [02:42<08:06, 1.07s/it]
Training 1/1 epoch (loss 3.3027): 27%|βββ | 170/625 [02:42<06:43, 1.13it/s]
Training 1/1 epoch (loss 3.1574): 27%|βββ | 170/625 [02:42<06:43, 1.13it/s]
Training 1/1 epoch (loss 3.1574): 27%|βββ | 171/625 [02:42<05:26, 1.39it/s]
Training 1/1 epoch (loss 3.4086): 27%|βββ | 171/625 [02:42<05:26, 1.39it/s]
Training 1/1 epoch (loss 3.4086): 28%|βββ | 172/625 [02:42<04:34, 1.65it/s]
Training 1/1 epoch (loss 3.1260): 28%|βββ | 172/625 [02:43<04:34, 1.65it/s]
Training 1/1 epoch (loss 3.1260): 28%|βββ | 173/625 [02:43<03:58, 1.90it/s]
Training 1/1 epoch (loss 3.0587): 28%|βββ | 173/625 [02:43<03:58, 1.90it/s]
Training 1/1 epoch (loss 3.0587): 28%|βββ | 174/625 [02:43<03:50, 1.95it/s]
Training 1/1 epoch (loss 3.3035): 28%|βββ | 174/625 [02:44<03:50, 1.95it/s]
Training 1/1 epoch (loss 3.3035): 28%|βββ | 175/625 [02:44<03:34, 2.10it/s]
Training 1/1 epoch (loss 3.0606): 28%|βββ | 175/625 [02:44<03:34, 2.10it/s]
Training 1/1 epoch (loss 3.0606): 28%|βββ | 176/625 [02:44<03:25, 2.18it/s]
Training 1/1 epoch (loss 3.1408): 28%|βββ | 176/625 [02:44<03:25, 2.18it/s]
Training 1/1 epoch (loss 3.1408): 28%|βββ | 177/625 [02:44<03:09, 2.37it/s]
Training 1/1 epoch (loss 3.0860): 28%|βββ | 177/625 [02:45<03:09, 2.37it/s]
Training 1/1 epoch (loss 3.0860): 28%|βββ | 178/625 [02:45<03:13, 2.31it/s]
Training 1/1 epoch (loss 3.0323): 28%|βββ | 178/625 [02:45<03:13, 2.31it/s]
Training 1/1 epoch (loss 3.0323): 29%|βββ | 179/625 [02:45<03:12, 2.31it/s]
Training 1/1 epoch (loss 3.1318): 29%|βββ | 179/625 [02:46<03:12, 2.31it/s]
Training 1/1 epoch (loss 3.1318): 29%|βββ | 180/625 [02:46<03:03, 2.43it/s]
Training 1/1 epoch (loss 3.2516): 29%|βββ | 180/625 [02:46<03:03, 2.43it/s]
Training 1/1 epoch (loss 3.2516): 29%|βββ | 181/625 [02:46<02:59, 2.48it/s]
Training 1/1 epoch (loss 3.1982): 29%|βββ | 181/625 [02:46<02:59, 2.48it/s]
Training 1/1 epoch (loss 3.1982): 29%|βββ | 182/625 [02:46<02:49, 2.62it/s]
Training 1/1 epoch (loss 3.4200): 29%|βββ | 182/625 [02:47<02:49, 2.62it/s]
Training 1/1 epoch (loss 3.4200): 29%|βββ | 183/625 [02:47<03:20, 2.20it/s]
Training 1/1 epoch (loss 3.1094): 29%|βββ | 183/625 [02:47<03:20, 2.20it/s]
Training 1/1 epoch (loss 3.1094): 29%|βββ | 184/625 [02:47<03:09, 2.33it/s]
Training 1/1 epoch (loss 3.0059): 29%|βββ | 184/625 [02:48<03:09, 2.33it/s]
Training 1/1 epoch (loss 3.0059): 30%|βββ | 185/625 [02:48<03:46, 1.94it/s]
Training 1/1 epoch (loss 2.9825): 30%|βββ | 185/625 [02:48<03:46, 1.94it/s]
Training 1/1 epoch (loss 2.9825): 30%|βββ | 186/625 [02:48<03:25, 2.13it/s]
Training 1/1 epoch (loss 3.2814): 30%|βββ | 186/625 [02:49<03:25, 2.13it/s]
Training 1/1 epoch (loss 3.2814): 30%|βββ | 187/625 [02:49<03:08, 2.33it/s]
Training 1/1 epoch (loss 2.9368): 30%|βββ | 187/625 [02:49<03:08, 2.33it/s]
Training 1/1 epoch (loss 2.9368): 30%|βββ | 188/625 [02:49<02:55, 2.49it/s]
Training 1/1 epoch (loss 3.1348): 30%|βββ | 188/625 [02:49<02:55, 2.49it/s]
Training 1/1 epoch (loss 3.1348): 30%|βββ | 189/625 [02:49<02:47, 2.60it/s]
Training 1/1 epoch (loss 3.4423): 30%|βββ | 189/625 [02:50<02:47, 2.60it/s]
Training 1/1 epoch (loss 3.4423): 30%|βββ | 190/625 [02:50<02:46, 2.62it/s]
Training 1/1 epoch (loss 3.1422): 30%|βββ | 190/625 [02:50<02:46, 2.62it/s]
Training 1/1 epoch (loss 3.1422): 31%|βββ | 191/625 [02:50<02:48, 2.58it/s]
Training 1/1 epoch (loss 3.2104): 31%|βββ | 191/625 [02:51<02:48, 2.58it/s]
Training 1/1 epoch (loss 3.2104): 31%|βββ | 192/625 [02:51<02:49, 2.56it/s]
Training 1/1 epoch (loss 3.0049): 31%|βββ | 192/625 [02:51<02:49, 2.56it/s]
Training 1/1 epoch (loss 3.0049): 31%|βββ | 193/625 [02:51<02:43, 2.63it/s]
Training 1/1 epoch (loss 3.1979): 31%|βββ | 193/625 [02:51<02:43, 2.63it/s]
Training 1/1 epoch (loss 3.1979): 31%|βββ | 194/625 [02:51<02:40, 2.68it/s]
Training 1/1 epoch (loss 3.0139): 31%|βββ | 194/625 [02:52<02:40, 2.68it/s]
Training 1/1 epoch (loss 3.0139): 31%|βββ | 195/625 [02:52<02:39, 2.70it/s]
Training 1/1 epoch (loss 3.5289): 31%|βββ | 195/625 [02:52<02:39, 2.70it/s]
Training 1/1 epoch (loss 3.5289): 31%|ββββ | 196/625 [02:52<02:41, 2.66it/s]
Training 1/1 epoch (loss 3.3669): 31%|ββββ | 196/625 [02:52<02:41, 2.66it/s]
Training 1/1 epoch (loss 3.3669): 32%|ββββ | 197/625 [02:52<02:48, 2.55it/s]
Training 1/1 epoch (loss 3.1781): 32%|ββββ | 197/625 [02:53<02:48, 2.55it/s]
Training 1/1 epoch (loss 3.1781): 32%|ββββ | 198/625 [02:53<02:40, 2.67it/s]
Training 1/1 epoch (loss 3.1821): 32%|ββββ | 198/625 [02:53<02:40, 2.67it/s]
Training 1/1 epoch (loss 3.1821): 32%|ββββ | 199/625 [02:53<02:34, 2.75it/s]
Training 1/1 epoch (loss 3.1466): 32%|ββββ | 199/625 [02:53<02:34, 2.75it/s]
Training 1/1 epoch (loss 3.1466): 32%|ββββ | 200/625 [02:53<02:31, 2.81it/s]
Training 1/1 epoch (loss 3.3506): 32%|ββββ | 200/625 [02:54<02:31, 2.81it/s]
Training 1/1 epoch (loss 3.3506): 32%|ββββ | 201/625 [02:54<02:51, 2.48it/s]
Training 1/1 epoch (loss 3.1460): 32%|ββββ | 201/625 [02:54<02:51, 2.48it/s]
Training 1/1 epoch (loss 3.1460): 32%|ββββ | 202/625 [02:54<02:40, 2.63it/s]
Training 1/1 epoch (loss 3.1813): 32%|ββββ | 202/625 [02:55<02:40, 2.63it/s]
Training 1/1 epoch (loss 3.1813): 32%|ββββ | 203/625 [02:55<02:35, 2.71it/s]
Training 1/1 epoch (loss 3.2970): 32%|ββββ | 203/625 [02:55<02:35, 2.71it/s]
Training 1/1 epoch (loss 3.2970): 33%|ββββ | 204/625 [02:55<02:28, 2.83it/s]
Training 1/1 epoch (loss 3.2643): 33%|ββββ | 204/625 [02:55<02:28, 2.83it/s]
Training 1/1 epoch (loss 3.2643): 33%|ββββ | 205/625 [02:55<02:25, 2.88it/s]
Training 1/1 epoch (loss 3.4331): 33%|ββββ | 205/625 [02:56<02:25, 2.88it/s]
Training 1/1 epoch (loss 3.4331): 33%|ββββ | 206/625 [02:56<02:24, 2.90it/s]
Training 1/1 epoch (loss 2.9393): 33%|ββββ | 206/625 [02:56<02:24, 2.90it/s]
Training 1/1 epoch (loss 2.9393): 33%|ββββ | 207/625 [02:56<02:31, 2.76it/s]
Training 1/1 epoch (loss 3.1915): 33%|ββββ | 207/625 [02:56<02:31, 2.76it/s]
Training 1/1 epoch (loss 3.1915): 33%|ββββ | 208/625 [02:56<02:31, 2.74it/s]
Training 1/1 epoch (loss 2.9711): 33%|ββββ | 208/625 [02:57<02:31, 2.74it/s]
Training 1/1 epoch (loss 2.9711): 33%|ββββ | 209/625 [02:57<02:28, 2.80it/s]
Training 1/1 epoch (loss 3.1610): 33%|ββββ | 209/625 [02:57<02:28, 2.80it/s]
Training 1/1 epoch (loss 3.1610): 34%|ββββ | 210/625 [02:57<02:28, 2.79it/s]
Training 1/1 epoch (loss 3.3766): 34%|ββββ | 210/625 [02:57<02:28, 2.79it/s]
Training 1/1 epoch (loss 3.3766): 34%|ββββ | 211/625 [02:57<02:27, 2.81it/s]
Training 1/1 epoch (loss 3.2696): 34%|ββββ | 211/625 [02:58<02:27, 2.81it/s]
Training 1/1 epoch (loss 3.2696): 34%|ββββ | 212/625 [02:58<02:29, 2.76it/s]
Training 1/1 epoch (loss 3.4566): 34%|ββββ | 212/625 [02:58<02:29, 2.76it/s]
Training 1/1 epoch (loss 3.4566): 34%|ββββ | 213/625 [02:58<02:50, 2.42it/s]
Training 1/1 epoch (loss 3.3696): 34%|ββββ | 213/625 [02:59<02:50, 2.42it/s]
Training 1/1 epoch (loss 3.3696): 34%|ββββ | 214/625 [02:59<02:47, 2.46it/s]
Training 1/1 epoch (loss 2.9788): 34%|ββββ | 214/625 [02:59<02:47, 2.46it/s]
Training 1/1 epoch (loss 2.9788): 34%|ββββ | 215/625 [02:59<02:48, 2.43it/s]
Training 1/1 epoch (loss 2.9146): 34%|ββββ | 215/625 [03:00<02:48, 2.43it/s]
Training 1/1 epoch (loss 2.9146): 35%|ββββ | 216/625 [03:00<02:53, 2.35it/s]
Training 1/1 epoch (loss 3.2574): 35%|ββββ | 216/625 [03:00<02:53, 2.35it/s]
Training 1/1 epoch (loss 3.2574): 35%|ββββ | 217/625 [03:00<02:55, 2.33it/s]
Training 1/1 epoch (loss 3.0125): 35%|ββββ | 217/625 [03:00<02:55, 2.33it/s]
Training 1/1 epoch (loss 3.0125): 35%|ββββ | 218/625 [03:00<02:43, 2.49it/s]
Training 1/1 epoch (loss 3.4519): 35%|ββββ | 218/625 [03:01<02:43, 2.49it/s]
Training 1/1 epoch (loss 3.4519): 35%|ββββ | 219/625 [03:01<02:36, 2.59it/s]
Training 1/1 epoch (loss 3.3739): 35%|ββββ | 219/625 [03:01<02:36, 2.59it/s]
Training 1/1 epoch (loss 3.3739): 35%|ββββ | 220/625 [03:01<02:34, 2.62it/s]
Training 1/1 epoch (loss 3.3418): 35%|ββββ | 220/625 [03:01<02:34, 2.62it/s]
Training 1/1 epoch (loss 3.3418): 35%|ββββ | 221/625 [03:01<02:30, 2.69it/s]
Training 1/1 epoch (loss 3.3660): 35%|ββββ | 221/625 [03:02<02:30, 2.69it/s]
Training 1/1 epoch (loss 3.3660): 36%|ββββ | 222/625 [03:02<02:34, 2.61it/s]
Training 1/1 epoch (loss 3.4154): 36%|ββββ | 222/625 [03:02<02:34, 2.61it/s]
Training 1/1 epoch (loss 3.4154): 36%|ββββ | 223/625 [03:02<02:36, 2.57it/s]
Training 1/1 epoch (loss 3.2738): 36%|ββββ | 223/625 [03:03<02:36, 2.57it/s]
Training 1/1 epoch (loss 3.2738): 36%|ββββ | 224/625 [03:03<02:48, 2.38it/s]
Training 1/1 epoch (loss 3.2692): 36%|ββββ | 224/625 [03:03<02:48, 2.38it/s]
Training 1/1 epoch (loss 3.2692): 36%|ββββ | 225/625 [03:03<02:46, 2.40it/s]
Training 1/1 epoch (loss 3.4553): 36%|ββββ | 225/625 [03:04<02:46, 2.40it/s]
Training 1/1 epoch (loss 3.4553): 36%|ββββ | 226/625 [03:04<02:56, 2.26it/s]
Training 1/1 epoch (loss 3.1175): 36%|ββββ | 226/625 [03:04<02:56, 2.26it/s]
Training 1/1 epoch (loss 3.1175): 36%|ββββ | 227/625 [03:04<03:10, 2.09it/s]
Training 1/1 epoch (loss 2.9826): 36%|ββββ | 227/625 [03:06<03:10, 2.09it/s]
Training 1/1 epoch (loss 2.9826): 36%|ββββ | 228/625 [03:06<05:17, 1.25it/s]
Training 1/1 epoch (loss 3.2014): 36%|ββββ | 228/625 [03:06<05:17, 1.25it/s]
Training 1/1 epoch (loss 3.2014): 37%|ββββ | 229/625 [03:06<04:31, 1.46it/s]
Training 1/1 epoch (loss 3.2336): 37%|ββββ | 229/625 [03:07<04:31, 1.46it/s]
Training 1/1 epoch (loss 3.2336): 37%|ββββ | 230/625 [03:07<03:51, 1.71it/s]
Training 1/1 epoch (loss 3.2794): 37%|ββββ | 230/625 [03:07<03:51, 1.71it/s]
Training 1/1 epoch (loss 3.2794): 37%|ββββ | 231/625 [03:07<03:27, 1.89it/s]
Training 1/1 epoch (loss 3.4170): 37%|ββββ | 231/625 [03:07<03:27, 1.89it/s]
Training 1/1 epoch (loss 3.4170): 37%|ββββ | 232/625 [03:07<03:13, 2.03it/s]
Training 1/1 epoch (loss 3.1187): 37%|ββββ | 232/625 [03:08<03:13, 2.03it/s]
Training 1/1 epoch (loss 3.1187): 37%|ββββ | 233/625 [03:08<03:24, 1.92it/s]
Training 1/1 epoch (loss 3.1473): 37%|ββββ | 233/625 [03:08<03:24, 1.92it/s]
Training 1/1 epoch (loss 3.1473): 37%|ββββ | 234/625 [03:08<03:14, 2.01it/s]
Training 1/1 epoch (loss 3.1678): 37%|ββββ | 234/625 [03:09<03:14, 2.01it/s]
Training 1/1 epoch (loss 3.1678): 38%|ββββ | 235/625 [03:09<03:05, 2.10it/s]
Training 1/1 epoch (loss 3.2566): 38%|ββββ | 235/625 [03:09<03:05, 2.10it/s]
Training 1/1 epoch (loss 3.2566): 38%|ββββ | 236/625 [03:09<02:57, 2.19it/s]
Training 1/1 epoch (loss 3.1806): 38%|ββββ | 236/625 [03:10<02:57, 2.19it/s]
Training 1/1 epoch (loss 3.1806): 38%|ββββ | 237/625 [03:10<02:54, 2.22it/s]
Training 1/1 epoch (loss 3.2215): 38%|ββββ | 237/625 [03:10<02:54, 2.22it/s]
Training 1/1 epoch (loss 3.2215): 38%|ββββ | 238/625 [03:10<02:56, 2.19it/s]
Training 1/1 epoch (loss 3.1663): 38%|ββββ | 238/625 [03:11<02:56, 2.19it/s]
Training 1/1 epoch (loss 3.1663): 38%|ββββ | 239/625 [03:11<04:23, 1.47it/s]
Training 1/1 epoch (loss 3.0495): 38%|ββββ | 239/625 [03:13<04:23, 1.47it/s]
Training 1/1 epoch (loss 3.0495): 38%|ββββ | 240/625 [03:13<05:17, 1.21it/s]
Training 1/1 epoch (loss 3.0159): 38%|ββββ | 240/625 [03:13<05:17, 1.21it/s]
Training 1/1 epoch (loss 3.0159): 39%|ββββ | 241/625 [03:13<05:09, 1.24it/s]
Training 1/1 epoch (loss 3.5519): 39%|ββββ | 241/625 [03:16<05:09, 1.24it/s]
Training 1/1 epoch (loss 3.5519): 39%|ββββ | 242/625 [03:16<08:51, 1.39s/it]
Training 1/1 epoch (loss 3.3440): 39%|ββββ | 242/625 [03:17<08:51, 1.39s/it]
Training 1/1 epoch (loss 3.3440): 39%|ββββ | 243/625 [03:17<08:06, 1.27s/it]
Training 1/1 epoch (loss 3.1673): 39%|ββββ | 243/625 [03:18<08:06, 1.27s/it]
Training 1/1 epoch (loss 3.1673): 39%|ββββ | 244/625 [03:18<07:00, 1.10s/it]
Training 1/1 epoch (loss 3.3256): 39%|ββββ | 244/625 [03:18<07:00, 1.10s/it]
Training 1/1 epoch (loss 3.3256): 39%|ββββ | 245/625 [03:18<05:57, 1.06it/s]
Training 1/1 epoch (loss 3.2604): 39%|ββββ | 245/625 [03:19<05:57, 1.06it/s]
Training 1/1 epoch (loss 3.2604): 39%|ββββ | 246/625 [03:19<05:20, 1.18it/s]
Training 1/1 epoch (loss 3.2147): 39%|ββββ | 246/625 [03:19<05:20, 1.18it/s]
Training 1/1 epoch (loss 3.2147): 40%|ββββ | 247/625 [03:19<04:40, 1.35it/s]
Training 1/1 epoch (loss 3.0223): 40%|ββββ | 247/625 [03:20<04:40, 1.35it/s]
Training 1/1 epoch (loss 3.0223): 40%|ββββ | 248/625 [03:20<04:31, 1.39it/s]
Training 1/1 epoch (loss 3.0372): 40%|ββββ | 248/625 [03:21<04:31, 1.39it/s]
Training 1/1 epoch (loss 3.0372): 40%|ββββ | 249/625 [03:21<04:17, 1.46it/s]
Training 1/1 epoch (loss 3.5042): 40%|ββββ | 249/625 [03:21<04:17, 1.46it/s]
Training 1/1 epoch (loss 3.5042): 40%|ββββ | 250/625 [03:21<04:19, 1.45it/s]
Training 1/1 epoch (loss 3.0854): 40%|ββββ | 250/625 [03:22<04:19, 1.45it/s]
Training 1/1 epoch (loss 3.0854): 40%|ββββ | 251/625 [03:22<04:16, 1.46it/s]
Training 1/1 epoch (loss 3.4528): 40%|ββββ | 251/625 [03:23<04:16, 1.46it/s]
Training 1/1 epoch (loss 3.4528): 40%|ββββ | 252/625 [03:23<04:50, 1.29it/s]
Training 1/1 epoch (loss 3.2099): 40%|ββββ | 252/625 [03:24<04:50, 1.29it/s]
Training 1/1 epoch (loss 3.2099): 40%|ββββ | 253/625 [03:24<05:06, 1.21it/s]
Training 1/1 epoch (loss 2.9796): 40%|ββββ | 253/625 [03:25<05:06, 1.21it/s]
Training 1/1 epoch (loss 2.9796): 41%|ββββ | 254/625 [03:25<05:18, 1.16it/s]
Training 1/1 epoch (loss 3.3242): 41%|ββββ | 254/625 [03:26<05:18, 1.16it/s]
Training 1/1 epoch (loss 3.3242): 41%|ββββ | 255/625 [03:26<05:25, 1.14it/s]
Training 1/1 epoch (loss 3.2312): 41%|ββββ | 255/625 [03:27<05:25, 1.14it/s]
Training 1/1 epoch (loss 3.2312): 41%|ββββ | 256/625 [03:27<06:25, 1.05s/it]
Training 1/1 epoch (loss 3.1103): 41%|ββββ | 256/625 [03:28<06:25, 1.05s/it]
Training 1/1 epoch (loss 3.1103): 41%|ββββ | 257/625 [03:28<06:29, 1.06s/it]
Training 1/1 epoch (loss 3.1216): 41%|ββββ | 257/625 [03:29<06:29, 1.06s/it]
Training 1/1 epoch (loss 3.1216): 41%|βββββ | 258/625 [03:29<06:11, 1.01s/it]
Training 1/1 epoch (loss 3.3977): 41%|βββββ | 258/625 [03:30<06:11, 1.01s/it]
Training 1/1 epoch (loss 3.3977): 41%|βββββ | 259/625 [03:30<05:24, 1.13it/s]
Training 1/1 epoch (loss 3.2104): 41%|βββββ | 259/625 [03:30<05:24, 1.13it/s]
Training 1/1 epoch (loss 3.2104): 42%|βββββ | 260/625 [03:30<04:50, 1.25it/s]
Training 1/1 epoch (loss 3.1461): 42%|βββββ | 260/625 [03:31<04:50, 1.25it/s]
Training 1/1 epoch (loss 3.1461): 42%|βββββ | 261/625 [03:31<04:30, 1.35it/s]
Training 1/1 epoch (loss 3.0574): 42%|βββββ | 261/625 [03:32<04:30, 1.35it/s]
Training 1/1 epoch (loss 3.0574): 42%|βββββ | 262/625 [03:32<04:13, 1.43it/s]
Training 1/1 epoch (loss 3.1022): 42%|βββββ | 262/625 [03:32<04:13, 1.43it/s]
Training 1/1 epoch (loss 3.1022): 42%|βββββ | 263/625 [03:32<03:36, 1.67it/s]
Training 1/1 epoch (loss 2.9490): 42%|βββββ | 263/625 [03:32<03:36, 1.67it/s]
Training 1/1 epoch (loss 2.9490): 42%|βββββ | 264/625 [03:32<03:12, 1.87it/s]
Training 1/1 epoch (loss 3.0977): 42%|βββββ | 264/625 [03:33<03:12, 1.87it/s]
Training 1/1 epoch (loss 3.0977): 42%|βββββ | 265/625 [03:33<02:48, 2.13it/s]
Training 1/1 epoch (loss 3.0430): 42%|βββββ | 265/625 [03:33<02:48, 2.13it/s]
Training 1/1 epoch (loss 3.0430): 43%|βββββ | 266/625 [03:33<02:34, 2.32it/s]
Training 1/1 epoch (loss 3.2400): 43%|βββββ | 266/625 [03:33<02:34, 2.32it/s]
Training 1/1 epoch (loss 3.2400): 43%|βββββ | 267/625 [03:33<02:23, 2.50it/s]
Training 1/1 epoch (loss 3.2094): 43%|βββββ | 267/625 [03:34<02:23, 2.50it/s]
Training 1/1 epoch (loss 3.2094): 43%|βββββ | 268/625 [03:34<03:10, 1.88it/s]
Training 1/1 epoch (loss 3.2616): 43%|βββββ | 268/625 [03:35<03:10, 1.88it/s]
Training 1/1 epoch (loss 3.2616): 43%|βββββ | 269/625 [03:35<03:43, 1.59it/s]
Training 1/1 epoch (loss 3.1367): 43%|βββββ | 269/625 [03:36<03:43, 1.59it/s]
Training 1/1 epoch (loss 3.1367): 43%|βββββ | 270/625 [03:36<04:42, 1.26it/s]
Training 1/1 epoch (loss 3.3698): 43%|βββββ | 270/625 [03:37<04:42, 1.26it/s]
Training 1/1 epoch (loss 3.3698): 43%|βββββ | 271/625 [03:37<04:41, 1.26it/s]
Training 1/1 epoch (loss 3.2125): 43%|βββββ | 271/625 [03:39<04:41, 1.26it/s]
Training 1/1 epoch (loss 3.2125): 44%|βββββ | 272/625 [03:39<05:56, 1.01s/it]
Training 1/1 epoch (loss 3.2749): 44%|βββββ | 272/625 [03:40<05:56, 1.01s/it]
Training 1/1 epoch (loss 3.2749): 44%|βββββ | 273/625 [03:40<05:54, 1.01s/it]
Training 1/1 epoch (loss 3.0599): 44%|βββββ | 273/625 [03:40<05:54, 1.01s/it]
Training 1/1 epoch (loss 3.0599): 44%|βββββ | 274/625 [03:40<05:39, 1.03it/s]
Training 1/1 epoch (loss 2.8943): 44%|βββββ | 274/625 [03:42<05:39, 1.03it/s]
Training 1/1 epoch (loss 2.8943): 44%|βββββ | 275/625 [03:42<05:54, 1.01s/it]
Training 1/1 epoch (loss 2.9009): 44%|βββββ | 275/625 [03:42<05:54, 1.01s/it]
Training 1/1 epoch (loss 2.9009): 44%|βββββ | 276/625 [03:42<05:14, 1.11it/s]
Training 1/1 epoch (loss 3.3333): 44%|βββββ | 276/625 [03:43<05:14, 1.11it/s]
Training 1/1 epoch (loss 3.3333): 44%|βββββ | 277/625 [03:43<04:37, 1.25it/s]
Training 1/1 epoch (loss 3.2020): 44%|βββββ | 277/625 [03:43<04:37, 1.25it/s]
Training 1/1 epoch (loss 3.2020): 44%|βββββ | 278/625 [03:43<04:15, 1.36it/s]
Training 1/1 epoch (loss 3.3284): 44%|βββββ | 278/625 [03:44<04:15, 1.36it/s]
Training 1/1 epoch (loss 3.3284): 45%|βββββ | 279/625 [03:44<03:47, 1.52it/s]
Training 1/1 epoch (loss 3.2154): 45%|βββββ | 279/625 [03:44<03:47, 1.52it/s]
Training 1/1 epoch (loss 3.2154): 45%|βββββ | 280/625 [03:44<03:38, 1.58it/s]
Training 1/1 epoch (loss 3.3455): 45%|βββββ | 280/625 [03:45<03:38, 1.58it/s]
Training 1/1 epoch (loss 3.3455): 45%|βββββ | 281/625 [03:45<04:18, 1.33it/s]
Training 1/1 epoch (loss 3.2256): 45%|βββββ | 281/625 [03:46<04:18, 1.33it/s]
Training 1/1 epoch (loss 3.2256): 45%|βββββ | 282/625 [03:46<04:34, 1.25it/s]
Training 1/1 epoch (loss 3.2300): 45%|βββββ | 282/625 [03:47<04:34, 1.25it/s]
Training 1/1 epoch (loss 3.2300): 45%|βββββ | 283/625 [03:47<04:43, 1.20it/s]
Training 1/1 epoch (loss 3.3518): 45%|βββββ | 283/625 [03:48<04:43, 1.20it/s]
Training 1/1 epoch (loss 3.3518): 45%|βββββ | 284/625 [03:48<04:47, 1.19it/s]
Training 1/1 epoch (loss 3.0899): 45%|βββββ | 284/625 [03:49<04:47, 1.19it/s]
Training 1/1 epoch (loss 3.0899): 46%|βββββ | 285/625 [03:49<04:34, 1.24it/s]
Training 1/1 epoch (loss 3.1900): 46%|βββββ | 285/625 [03:50<04:34, 1.24it/s]
Training 1/1 epoch (loss 3.1900): 46%|βββββ | 286/625 [03:50<04:29, 1.26it/s]
Training 1/1 epoch (loss 3.0958): 46%|βββββ | 286/625 [03:51<04:29, 1.26it/s]
Training 1/1 epoch (loss 3.0958): 46%|βββββ | 287/625 [03:51<04:37, 1.22it/s]
Training 1/1 epoch (loss 3.0679): 46%|βββββ | 287/625 [03:51<04:37, 1.22it/s]
Training 1/1 epoch (loss 3.0679): 46%|βββββ | 288/625 [03:51<04:06, 1.36it/s]
Training 1/1 epoch (loss 3.1870): 46%|βββββ | 288/625 [03:52<04:06, 1.36it/s]
Training 1/1 epoch (loss 3.1870): 46%|βββββ | 289/625 [03:52<04:35, 1.22it/s]
Training 1/1 epoch (loss 3.3747): 46%|βββββ | 289/625 [03:54<04:35, 1.22it/s]
Training 1/1 epoch (loss 3.3747): 46%|βββββ | 290/625 [03:54<05:36, 1.00s/it]
Training 1/1 epoch (loss 3.3142): 46%|βββββ | 290/625 [03:54<05:36, 1.00s/it]
Training 1/1 epoch (loss 3.3142): 47%|βββββ | 291/625 [03:54<04:41, 1.19it/s]
Training 1/1 epoch (loss 3.5432): 47%|βββββ | 291/625 [03:54<04:41, 1.19it/s]
Training 1/1 epoch (loss 3.5432): 47%|βββββ | 292/625 [03:54<04:02, 1.37it/s]
Training 1/1 epoch (loss 3.3009): 47%|βββββ | 292/625 [03:55<04:02, 1.37it/s]
Training 1/1 epoch (loss 3.3009): 47%|βββββ | 293/625 [03:55<04:22, 1.26it/s]
Training 1/1 epoch (loss 2.9061): 47%|βββββ | 293/625 [03:56<04:22, 1.26it/s]
Training 1/1 epoch (loss 2.9061): 47%|βββββ | 294/625 [03:56<04:36, 1.20it/s]
Training 1/1 epoch (loss 3.2268): 47%|βββββ | 294/625 [03:57<04:36, 1.20it/s]
Training 1/1 epoch (loss 3.2268): 47%|βββββ | 295/625 [03:57<04:19, 1.27it/s]
Training 1/1 epoch (loss 3.0651): 47%|βββββ | 295/625 [03:58<04:19, 1.27it/s]
Training 1/1 epoch (loss 3.0651): 47%|βββββ | 296/625 [03:58<05:05, 1.08it/s]
Training 1/1 epoch (loss 3.4198): 47%|βββββ | 296/625 [04:00<05:05, 1.08it/s]
Training 1/1 epoch (loss 3.4198): 48%|βββββ | 297/625 [04:00<05:52, 1.07s/it]
Training 1/1 epoch (loss 2.9623): 48%|βββββ | 297/625 [04:00<05:52, 1.07s/it]
Training 1/1 epoch (loss 2.9623): 48%|βββββ | 298/625 [04:00<04:55, 1.11it/s]
Training 1/1 epoch (loss 2.8023): 48%|βββββ | 298/625 [04:01<04:55, 1.11it/s]
Training 1/1 epoch (loss 2.8023): 48%|βββββ | 299/625 [04:01<04:46, 1.14it/s]
Training 1/1 epoch (loss 3.1175): 48%|βββββ | 299/625 [04:02<04:46, 1.14it/s]
Training 1/1 epoch (loss 3.1175): 48%|βββββ | 300/625 [04:02<04:44, 1.14it/s]
Training 1/1 epoch (loss 3.0765): 48%|βββββ | 300/625 [04:03<04:44, 1.14it/s]
Training 1/1 epoch (loss 3.0765): 48%|βββββ | 301/625 [04:03<04:43, 1.14it/s]
Training 1/1 epoch (loss 3.2109): 48%|βββββ | 301/625 [04:03<04:43, 1.14it/s]
Training 1/1 epoch (loss 3.2109): 48%|βββββ | 302/625 [04:03<03:51, 1.39it/s]
Training 1/1 epoch (loss 3.1052): 48%|βββββ | 302/625 [04:04<03:51, 1.39it/s]
Training 1/1 epoch (loss 3.1052): 48%|βββββ | 303/625 [04:04<03:49, 1.41it/s]
Training 1/1 epoch (loss 3.2653): 48%|βββββ | 303/625 [04:05<03:49, 1.41it/s]
Training 1/1 epoch (loss 3.2653): 49%|βββββ | 304/625 [04:05<04:42, 1.14it/s]
Training 1/1 epoch (loss 3.1698): 49%|βββββ | 304/625 [04:06<04:42, 1.14it/s]
Training 1/1 epoch (loss 3.1698): 49%|βββββ | 305/625 [04:06<04:46, 1.12it/s]
Training 1/1 epoch (loss 3.0104): 49%|βββββ | 305/625 [04:07<04:46, 1.12it/s]
Training 1/1 epoch (loss 3.0104): 49%|βββββ | 306/625 [04:07<04:15, 1.25it/s]
Training 1/1 epoch (loss 3.3095): 49%|βββββ | 306/625 [04:07<04:15, 1.25it/s]
Training 1/1 epoch (loss 3.3095): 49%|βββββ | 307/625 [04:07<04:27, 1.19it/s]
Training 1/1 epoch (loss 3.1918): 49%|βββββ | 307/625 [04:09<04:27, 1.19it/s]
Training 1/1 epoch (loss 3.1918): 49%|βββββ | 308/625 [04:09<05:08, 1.03it/s]
Training 1/1 epoch (loss 3.0995): 49%|βββββ | 308/625 [04:09<05:08, 1.03it/s]
Training 1/1 epoch (loss 3.0995): 49%|βββββ | 309/625 [04:09<04:25, 1.19it/s]
Training 1/1 epoch (loss 3.0218): 49%|βββββ | 309/625 [04:10<04:25, 1.19it/s]
Training 1/1 epoch (loss 3.0218): 50%|βββββ | 310/625 [04:10<04:18, 1.22it/s]
Training 1/1 epoch (loss 2.9653): 50%|βββββ | 310/625 [04:11<04:18, 1.22it/s]
Training 1/1 epoch (loss 2.9653): 50%|βββββ | 311/625 [04:11<04:30, 1.16it/s]
Training 1/1 epoch (loss 3.2209): 50%|βββββ | 311/625 [04:12<04:30, 1.16it/s]
Training 1/1 epoch (loss 3.2209): 50%|βββββ | 312/625 [04:12<04:54, 1.06it/s]
Training 1/1 epoch (loss 3.0173): 50%|βββββ | 312/625 [04:13<04:54, 1.06it/s]
Training 1/1 epoch (loss 3.0173): 50%|βββββ | 313/625 [04:13<04:13, 1.23it/s]
Training 1/1 epoch (loss 3.0470): 50%|βββββ | 313/625 [04:13<04:13, 1.23it/s]
Training 1/1 epoch (loss 3.0470): 50%|βββββ | 314/625 [04:13<04:11, 1.24it/s]
Training 1/1 epoch (loss 3.4776): 50%|βββββ | 314/625 [04:15<04:11, 1.24it/s]
Training 1/1 epoch (loss 3.4776): 50%|βββββ | 315/625 [04:15<04:45, 1.08it/s]
Training 1/1 epoch (loss 3.3164): 50%|βββββ | 315/625 [04:15<04:45, 1.08it/s]
Training 1/1 epoch (loss 3.3164): 51%|βββββ | 316/625 [04:15<03:56, 1.31it/s]
Training 1/1 epoch (loss 3.2731): 51%|βββββ | 316/625 [04:16<03:56, 1.31it/s]
Training 1/1 epoch (loss 3.2731): 51%|βββββ | 317/625 [04:16<03:51, 1.33it/s]
Training 1/1 epoch (loss 3.1743): 51%|βββββ | 317/625 [04:17<03:51, 1.33it/s]
Training 1/1 epoch (loss 3.1743): 51%|βββββ | 318/625 [04:17<04:05, 1.25it/s]
Training 1/1 epoch (loss 3.3121): 51%|βββββ | 318/625 [04:17<04:05, 1.25it/s]
Training 1/1 epoch (loss 3.3121): 51%|βββββ | 319/625 [04:17<03:59, 1.28it/s]
Training 1/1 epoch (loss 3.0561): 51%|βββββ | 319/625 [04:19<03:59, 1.28it/s]
Training 1/1 epoch (loss 3.0561): 51%|βββββ | 320/625 [04:19<05:16, 1.04s/it]
Training 1/1 epoch (loss 3.2501): 51%|βββββ | 320/625 [04:20<05:16, 1.04s/it]
Training 1/1 epoch (loss 3.2501): 51%|ββββββ | 321/625 [04:20<05:20, 1.06s/it]
Training 1/1 epoch (loss 3.3007): 51%|ββββββ | 321/625 [04:21<05:20, 1.06s/it]
Training 1/1 epoch (loss 3.3007): 52%|ββββββ | 322/625 [04:21<05:06, 1.01s/it]
Training 1/1 epoch (loss 3.2774): 52%|ββββββ | 322/625 [04:21<05:06, 1.01s/it]
Training 1/1 epoch (loss 3.2774): 52%|ββββββ | 323/625 [04:21<04:09, 1.21it/s]
Training 1/1 epoch (loss 3.3303): 52%|ββββββ | 323/625 [04:23<04:09, 1.21it/s]
Training 1/1 epoch (loss 3.3303): 52%|ββββββ | 324/625 [04:23<04:27, 1.12it/s]
Training 1/1 epoch (loss 3.3703): 52%|ββββββ | 324/625 [04:23<04:27, 1.12it/s]
Training 1/1 epoch (loss 3.3703): 52%|ββββββ | 325/625 [04:23<04:31, 1.10it/s]
Training 1/1 epoch (loss 3.0689): 52%|ββββββ | 325/625 [04:24<04:31, 1.10it/s]
Training 1/1 epoch (loss 3.0689): 52%|ββββββ | 326/625 [04:24<04:16, 1.17it/s]
Training 1/1 epoch (loss 3.1865): 52%|ββββββ | 326/625 [04:25<04:16, 1.17it/s]
Training 1/1 epoch (loss 3.1865): 52%|ββββββ | 327/625 [04:25<04:04, 1.22it/s]
Training 1/1 epoch (loss 3.0299): 52%|ββββββ | 327/625 [04:26<04:04, 1.22it/s]
Training 1/1 epoch (loss 3.0299): 52%|ββββββ | 328/625 [04:26<05:01, 1.01s/it]
Training 1/1 epoch (loss 3.1426): 52%|ββββββ | 328/625 [04:27<05:01, 1.01s/it]
Training 1/1 epoch (loss 3.1426): 53%|ββββββ | 329/625 [04:27<04:56, 1.00s/it]
Training 1/1 epoch (loss 3.1810): 53%|ββββββ | 329/625 [04:28<04:56, 1.00s/it]
Training 1/1 epoch (loss 3.1810): 53%|ββββββ | 330/625 [04:28<04:46, 1.03it/s]
Training 1/1 epoch (loss 3.2788): 53%|ββββββ | 330/625 [04:29<04:46, 1.03it/s]
Training 1/1 epoch (loss 3.2788): 53%|ββββββ | 331/625 [04:29<04:42, 1.04it/s]
Training 1/1 epoch (loss 3.3671): 53%|ββββββ | 331/625 [04:30<04:42, 1.04it/s]
Training 1/1 epoch (loss 3.3671): 53%|ββββββ | 332/625 [04:30<05:10, 1.06s/it]
Training 1/1 epoch (loss 3.0538): 53%|ββββββ | 332/625 [04:31<05:10, 1.06s/it]
Training 1/1 epoch (loss 3.0538): 53%|ββββββ | 333/625 [04:31<04:16, 1.14it/s]
Training 1/1 epoch (loss 3.1766): 53%|ββββββ | 333/625 [04:32<04:16, 1.14it/s]
Training 1/1 epoch (loss 3.1766): 53%|ββββββ | 334/625 [04:32<04:20, 1.12it/s]
Training 1/1 epoch (loss 3.0384): 53%|ββββββ | 334/625 [04:33<04:20, 1.12it/s]
Training 1/1 epoch (loss 3.0384): 54%|ββββββ | 335/625 [04:33<05:04, 1.05s/it]
Training 1/1 epoch (loss 3.2955): 54%|ββββββ | 335/625 [04:34<05:04, 1.05s/it]
Training 1/1 epoch (loss 3.2955): 54%|ββββββ | 336/625 [04:34<04:40, 1.03it/s]
Training 1/1 epoch (loss 3.3208): 54%|ββββββ | 336/625 [04:35<04:40, 1.03it/s]
Training 1/1 epoch (loss 3.3208): 54%|ββββββ | 337/625 [04:35<04:16, 1.12it/s]
Training 1/1 epoch (loss 3.3380): 54%|ββββββ | 337/625 [04:36<04:16, 1.12it/s]
Training 1/1 epoch (loss 3.3380): 54%|ββββββ | 338/625 [04:36<04:18, 1.11it/s]
Training 1/1 epoch (loss 2.9700): 54%|ββββββ | 338/625 [04:37<04:18, 1.11it/s]
Training 1/1 epoch (loss 2.9700): 54%|ββββββ | 339/625 [04:37<04:10, 1.14it/s]
Training 1/1 epoch (loss 3.2894): 54%|ββββββ | 339/625 [04:37<04:10, 1.14it/s]
Training 1/1 epoch (loss 3.2894): 54%|ββββββ | 340/625 [04:37<03:39, 1.30it/s]
Training 1/1 epoch (loss 3.1004): 54%|ββββββ | 340/625 [04:38<03:39, 1.30it/s]
Training 1/1 epoch (loss 3.1004): 55%|ββββββ | 341/625 [04:38<03:26, 1.38it/s]
Training 1/1 epoch (loss 3.2386): 55%|ββββββ | 341/625 [04:38<03:26, 1.38it/s]
Training 1/1 epoch (loss 3.2386): 55%|ββββββ | 342/625 [04:38<03:32, 1.33it/s]
Training 1/1 epoch (loss 2.9878): 55%|ββββββ | 342/625 [04:39<03:32, 1.33it/s]
Training 1/1 epoch (loss 2.9878): 55%|ββββββ | 343/625 [04:39<03:20, 1.40it/s]
Training 1/1 epoch (loss 3.0397): 55%|ββββββ | 343/625 [04:41<03:20, 1.40it/s]
Training 1/1 epoch (loss 3.0397): 55%|ββββββ | 344/625 [04:41<04:19, 1.08it/s]
Training 1/1 epoch (loss 3.1096): 55%|ββββββ | 344/625 [04:42<04:19, 1.08it/s]
Training 1/1 epoch (loss 3.1096): 55%|ββββββ | 345/625 [04:42<04:33, 1.02it/s]
Training 1/1 epoch (loss 3.0040): 55%|ββββββ | 345/625 [04:42<04:33, 1.02it/s]
Training 1/1 epoch (loss 3.0040): 55%|ββββββ | 346/625 [04:42<04:14, 1.10it/s]
Training 1/1 epoch (loss 3.2544): 55%|ββββββ | 346/625 [04:43<04:14, 1.10it/s]
Training 1/1 epoch (loss 3.2544): 56%|ββββββ | 347/625 [04:43<03:50, 1.20it/s]
Training 1/1 epoch (loss 3.5455): 56%|ββββββ | 347/625 [04:44<03:50, 1.20it/s]
Training 1/1 epoch (loss 3.5455): 56%|ββββββ | 348/625 [04:44<04:40, 1.01s/it]
Training 1/1 epoch (loss 3.1219): 56%|ββββββ | 348/625 [04:45<04:40, 1.01s/it]
Training 1/1 epoch (loss 3.1219): 56%|ββββββ | 349/625 [04:45<04:19, 1.06it/s]
Training 1/1 epoch (loss 3.2469): 56%|ββββββ | 349/625 [04:46<04:19, 1.06it/s]
Training 1/1 epoch (loss 3.2469): 56%|ββββββ | 350/625 [04:46<03:42, 1.23it/s]
Training 1/1 epoch (loss 3.0619): 56%|ββββββ | 350/625 [04:46<03:42, 1.23it/s]
Training 1/1 epoch (loss 3.0619): 56%|ββββββ | 351/625 [04:46<03:28, 1.31it/s]
Training 1/1 epoch (loss 3.5212): 56%|ββββββ | 351/625 [04:48<03:28, 1.31it/s]
Training 1/1 epoch (loss 3.5212): 56%|ββββββ | 352/625 [04:48<04:20, 1.05it/s]
Training 1/1 epoch (loss 3.7363): 56%|ββββββ | 352/625 [04:48<04:20, 1.05it/s]
Training 1/1 epoch (loss 3.7363): 56%|ββββββ | 353/625 [04:48<03:45, 1.20it/s]
Training 1/1 epoch (loss 3.1745): 56%|ββββββ | 353/625 [04:49<03:45, 1.20it/s]
Training 1/1 epoch (loss 3.1745): 57%|ββββββ | 354/625 [04:49<03:53, 1.16it/s]
Training 1/1 epoch (loss 3.2181): 57%|ββββββ | 354/625 [04:50<03:53, 1.16it/s]
Training 1/1 epoch (loss 3.2181): 57%|ββββββ | 355/625 [04:50<04:14, 1.06it/s]
Training 1/1 epoch (loss 3.1856): 57%|ββββββ | 355/625 [04:51<04:14, 1.06it/s]
Training 1/1 epoch (loss 3.1856): 57%|ββββββ | 356/625 [04:51<03:49, 1.17it/s]
Training 1/1 epoch (loss 3.3635): 57%|ββββββ | 356/625 [04:52<03:49, 1.17it/s]
Training 1/1 epoch (loss 3.3635): 57%|ββββββ | 357/625 [04:52<03:59, 1.12it/s]
Training 1/1 epoch (loss 3.1293): 57%|ββββββ | 357/625 [04:53<03:59, 1.12it/s]
Training 1/1 epoch (loss 3.1293): 57%|ββββββ | 358/625 [04:53<03:54, 1.14it/s]
Training 1/1 epoch (loss 3.3436): 57%|ββββββ | 358/625 [04:54<03:54, 1.14it/s]
Training 1/1 epoch (loss 3.3436): 57%|ββββββ | 359/625 [04:54<03:57, 1.12it/s]
Training 1/1 epoch (loss 3.0277): 57%|ββββββ | 359/625 [04:55<03:57, 1.12it/s]
Training 1/1 epoch (loss 3.0277): 58%|ββββββ | 360/625 [04:55<04:00, 1.10it/s]
Training 1/1 epoch (loss 3.2161): 58%|ββββββ | 360/625 [04:56<04:00, 1.10it/s]
Training 1/1 epoch (loss 3.2161): 58%|ββββββ | 361/625 [04:56<04:21, 1.01it/s]
Training 1/1 epoch (loss 3.1182): 58%|ββββββ | 361/625 [04:57<04:21, 1.01it/s]
Training 1/1 epoch (loss 3.1182): 58%|ββββββ | 362/625 [04:57<04:10, 1.05it/s]
Training 1/1 epoch (loss 3.2067): 58%|ββββββ | 362/625 [04:58<04:10, 1.05it/s]
Training 1/1 epoch (loss 3.2067): 58%|ββββββ | 363/625 [04:58<04:00, 1.09it/s]
Training 1/1 epoch (loss 3.1717): 58%|ββββββ | 363/625 [04:58<04:00, 1.09it/s]
Training 1/1 epoch (loss 3.1717): 58%|ββββββ | 364/625 [04:58<03:55, 1.11it/s]
Training 1/1 epoch (loss 2.9227): 58%|ββββββ | 364/625 [04:59<03:55, 1.11it/s]
Training 1/1 epoch (loss 2.9227): 58%|ββββββ | 365/625 [04:59<03:55, 1.10it/s]
Training 1/1 epoch (loss 3.1099): 58%|ββββββ | 365/625 [05:00<03:55, 1.10it/s]
Training 1/1 epoch (loss 3.1099): 59%|ββββββ | 366/625 [05:00<04:06, 1.05it/s]
Training 1/1 epoch (loss 3.1573): 59%|ββββββ | 366/625 [05:01<04:06, 1.05it/s]
Training 1/1 epoch (loss 3.1573): 59%|ββββββ | 367/625 [05:01<03:43, 1.15it/s]
Training 1/1 epoch (loss 3.0254): 59%|ββββββ | 367/625 [05:03<03:43, 1.15it/s]
Training 1/1 epoch (loss 3.0254): 59%|ββββββ | 368/625 [05:03<04:58, 1.16s/it]
Training 1/1 epoch (loss 3.0362): 59%|ββββββ | 368/625 [05:04<04:58, 1.16s/it]
Training 1/1 epoch (loss 3.0362): 59%|ββββββ | 369/625 [05:04<04:39, 1.09s/it]
Training 1/1 epoch (loss 3.1640): 59%|ββββββ | 369/625 [05:04<04:39, 1.09s/it]
Training 1/1 epoch (loss 3.1640): 59%|ββββββ | 370/625 [05:04<03:48, 1.12it/s]
Training 1/1 epoch (loss 3.3172): 59%|ββββββ | 370/625 [05:05<03:48, 1.12it/s]
Training 1/1 epoch (loss 3.3172): 59%|ββββββ | 371/625 [05:05<03:48, 1.11it/s]
Training 1/1 epoch (loss 3.3140): 59%|ββββββ | 371/625 [05:06<03:48, 1.11it/s]
Training 1/1 epoch (loss 3.3140): 60%|ββββββ | 372/625 [05:06<04:03, 1.04it/s]
Training 1/1 epoch (loss 3.1436): 60%|ββββββ | 372/625 [05:07<04:03, 1.04it/s]
Training 1/1 epoch (loss 3.1436): 60%|ββββββ | 373/625 [05:07<03:46, 1.11it/s]
Training 1/1 epoch (loss 3.2907): 60%|ββββββ | 373/625 [05:08<03:46, 1.11it/s]
Training 1/1 epoch (loss 3.2907): 60%|ββββββ | 374/625 [05:08<03:12, 1.31it/s]
Training 1/1 epoch (loss 3.1772): 60%|ββββββ | 374/625 [05:08<03:12, 1.31it/s]
Training 1/1 epoch (loss 3.1772): 60%|ββββββ | 375/625 [05:08<03:19, 1.26it/s]
Training 1/1 epoch (loss 3.1735): 60%|ββββββ | 375/625 [05:10<03:19, 1.26it/s]
Training 1/1 epoch (loss 3.1735): 60%|ββββββ | 376/625 [05:10<04:20, 1.05s/it]
Training 1/1 epoch (loss 3.0616): 60%|ββββββ | 376/625 [05:11<04:20, 1.05s/it]
Training 1/1 epoch (loss 3.0616): 60%|ββββββ | 377/625 [05:11<03:56, 1.05it/s]
Training 1/1 epoch (loss 3.0465): 60%|ββββββ | 377/625 [05:12<03:56, 1.05it/s]
Training 1/1 epoch (loss 3.0465): 60%|ββββββ | 378/625 [05:12<03:53, 1.06it/s]
Training 1/1 epoch (loss 3.3822): 60%|ββββββ | 378/625 [05:13<03:53, 1.06it/s]
Training 1/1 epoch (loss 3.3822): 61%|ββββββ | 379/625 [05:13<03:49, 1.07it/s]
Training 1/1 epoch (loss 2.9884): 61%|ββββββ | 379/625 [05:13<03:49, 1.07it/s]
Training 1/1 epoch (loss 2.9884): 61%|ββββββ | 380/625 [05:13<03:29, 1.17it/s]
Training 1/1 epoch (loss 3.3020): 61%|ββββββ | 380/625 [05:15<03:29, 1.17it/s]
Training 1/1 epoch (loss 3.3020): 61%|ββββββ | 381/625 [05:15<04:03, 1.00it/s]
Training 1/1 epoch (loss 3.0183): 61%|ββββββ | 381/625 [05:15<04:03, 1.00it/s]
Training 1/1 epoch (loss 3.0183): 61%|ββββββ | 382/625 [05:15<03:27, 1.17it/s]
Training 1/1 epoch (loss 3.3747): 61%|ββββββ | 382/625 [05:16<03:27, 1.17it/s]
Training 1/1 epoch (loss 3.3747): 61%|βββββββ | 383/625 [05:16<03:02, 1.33it/s]
Training 1/1 epoch (loss 3.0415): 61%|βββββββ | 383/625 [05:17<03:02, 1.33it/s]
Training 1/1 epoch (loss 3.0415): 61%|βββββββ | 384/625 [05:17<04:00, 1.00it/s]
Training 1/1 epoch (loss 3.0800): 61%|βββββββ | 384/625 [05:18<04:00, 1.00it/s]
Training 1/1 epoch (loss 3.0800): 62%|βββββββ | 385/625 [05:18<03:38, 1.10it/s]
Training 1/1 epoch (loss 3.3758): 62%|βββββββ | 385/625 [05:19<03:38, 1.10it/s]
Training 1/1 epoch (loss 3.3758): 62%|βββββββ | 386/625 [05:19<03:20, 1.19it/s]
Training 1/1 epoch (loss 3.2609): 62%|βββββββ | 386/625 [05:20<03:20, 1.19it/s]
Training 1/1 epoch (loss 3.2609): 62%|βββββββ | 387/625 [05:20<03:25, 1.16it/s]
Training 1/1 epoch (loss 3.2352): 62%|βββββββ | 387/625 [05:20<03:25, 1.16it/s]
Training 1/1 epoch (loss 3.2352): 62%|βββββββ | 388/625 [05:20<03:04, 1.28it/s]
Training 1/1 epoch (loss 2.9737): 62%|βββββββ | 388/625 [05:21<03:04, 1.28it/s]
Training 1/1 epoch (loss 2.9737): 62%|βββββββ | 389/625 [05:21<02:58, 1.32it/s]
Training 1/1 epoch (loss 3.2616): 62%|βββββββ | 389/625 [05:22<02:58, 1.32it/s]
Training 1/1 epoch (loss 3.2616): 62%|βββββββ | 390/625 [05:22<02:54, 1.34it/s]
Training 1/1 epoch (loss 3.1865): 62%|βββββββ | 390/625 [05:22<02:54, 1.34it/s]
Training 1/1 epoch (loss 3.1865): 63%|βββββββ | 391/625 [05:22<02:46, 1.40it/s]
Training 1/1 epoch (loss 3.3654): 63%|βββββββ | 391/625 [05:23<02:46, 1.40it/s]
Training 1/1 epoch (loss 3.3654): 63%|βββββββ | 392/625 [05:23<03:02, 1.28it/s]
Training 1/1 epoch (loss 3.1513): 63%|βββββββ | 392/625 [05:24<03:02, 1.28it/s]
Training 1/1 epoch (loss 3.1513): 63%|βββββββ | 393/625 [05:24<02:39, 1.45it/s]
Training 1/1 epoch (loss 3.1271): 63%|βββββββ | 393/625 [05:24<02:39, 1.45it/s]
Training 1/1 epoch (loss 3.1271): 63%|βββββββ | 394/625 [05:24<02:43, 1.42it/s]
Training 1/1 epoch (loss 3.2543): 63%|βββββββ | 394/625 [05:25<02:43, 1.42it/s]
Training 1/1 epoch (loss 3.2543): 63%|βββββββ | 395/625 [05:25<02:56, 1.30it/s]
Training 1/1 epoch (loss 3.0909): 63%|βββββββ | 395/625 [05:26<02:56, 1.30it/s]
Training 1/1 epoch (loss 3.0909): 63%|βββββββ | 396/625 [05:26<02:54, 1.31it/s]
Training 1/1 epoch (loss 3.3031): 63%|βββββββ | 396/625 [05:27<02:54, 1.31it/s]
Training 1/1 epoch (loss 3.3031): 64%|βββββββ | 397/625 [05:27<02:49, 1.35it/s]
Training 1/1 epoch (loss 3.1584): 64%|βββββββ | 397/625 [05:28<02:49, 1.35it/s]
Training 1/1 epoch (loss 3.1584): 64%|βββββββ | 398/625 [05:28<03:26, 1.10it/s]
Training 1/1 epoch (loss 3.1957): 64%|βββββββ | 398/625 [05:29<03:26, 1.10it/s]
Training 1/1 epoch (loss 3.1957): 64%|βββββββ | 399/625 [05:29<03:42, 1.02it/s]
Training 1/1 epoch (loss 3.0450): 64%|βββββββ | 399/625 [05:30<03:42, 1.02it/s]
Training 1/1 epoch (loss 3.0450): 64%|βββββββ | 400/625 [05:30<03:39, 1.02it/s]
Training 1/1 epoch (loss 3.1817): 64%|βββββββ | 400/625 [05:32<03:39, 1.02it/s]
Training 1/1 epoch (loss 3.1817): 64%|βββββββ | 401/625 [05:32<04:05, 1.10s/it]
Training 1/1 epoch (loss 3.3266): 64%|βββββββ | 401/625 [05:32<04:05, 1.10s/it]
Training 1/1 epoch (loss 3.3266): 64%|βββββββ | 402/625 [05:32<03:42, 1.00it/s]
Training 1/1 epoch (loss 3.3967): 64%|βββββββ | 402/625 [05:33<03:42, 1.00it/s]
Training 1/1 epoch (loss 3.3967): 64%|βββββββ | 403/625 [05:33<02:59, 1.24it/s]
Training 1/1 epoch (loss 3.3208): 64%|βββββββ | 403/625 [05:34<02:59, 1.24it/s]
Training 1/1 epoch (loss 3.3208): 65%|βββββββ | 404/625 [05:34<03:03, 1.20it/s]
Training 1/1 epoch (loss 3.1587): 65%|βββββββ | 404/625 [05:34<03:03, 1.20it/s]
Training 1/1 epoch (loss 3.1587): 65%|βββββββ | 405/625 [05:34<03:08, 1.17it/s]
Training 1/1 epoch (loss 3.1101): 65%|βββββββ | 405/625 [05:35<03:08, 1.17it/s]
Training 1/1 epoch (loss 3.1101): 65%|βββββββ | 406/625 [05:35<03:14, 1.13it/s]
Training 1/1 epoch (loss 3.2729): 65%|βββββββ | 406/625 [05:36<03:14, 1.13it/s]
Training 1/1 epoch (loss 3.2729): 65%|βββββββ | 407/625 [05:36<02:43, 1.34it/s]
Training 1/1 epoch (loss 3.1677): 65%|βββββββ | 407/625 [05:37<02:43, 1.34it/s]
Training 1/1 epoch (loss 3.1677): 65%|βββββββ | 408/625 [05:37<03:42, 1.03s/it]
Training 1/1 epoch (loss 3.3238): 65%|βββββββ | 408/625 [05:39<03:42, 1.03s/it]
Training 1/1 epoch (loss 3.3238): 65%|βββββββ | 409/625 [05:39<03:47, 1.05s/it]
Training 1/1 epoch (loss 3.4545): 65%|βββββββ | 409/625 [05:39<03:47, 1.05s/it]
Training 1/1 epoch (loss 3.4545): 66%|βββββββ | 410/625 [05:39<03:23, 1.06it/s]
Training 1/1 epoch (loss 2.9143): 66%|βββββββ | 410/625 [05:40<03:23, 1.06it/s]
Training 1/1 epoch (loss 2.9143): 66%|βββββββ | 411/625 [05:40<03:20, 1.07it/s]
Training 1/1 epoch (loss 3.1663): 66%|βββββββ | 411/625 [05:42<03:20, 1.07it/s]
Training 1/1 epoch (loss 3.1663): 66%|βββββββ | 412/625 [05:42<03:43, 1.05s/it]
Training 1/1 epoch (loss 3.2379): 66%|βββββββ | 412/625 [05:42<03:43, 1.05s/it]
Training 1/1 epoch (loss 3.2379): 66%|βββββββ | 413/625 [05:42<03:12, 1.10it/s]
Training 1/1 epoch (loss 3.4087): 66%|βββββββ | 413/625 [05:43<03:12, 1.10it/s]
Training 1/1 epoch (loss 3.4087): 66%|βββββββ | 414/625 [05:43<03:12, 1.09it/s]
Training 1/1 epoch (loss 3.0781): 66%|βββββββ | 414/625 [05:44<03:12, 1.09it/s]
Training 1/1 epoch (loss 3.0781): 66%|βββββββ | 415/625 [05:44<03:12, 1.09it/s]
Training 1/1 epoch (loss 2.9495): 66%|βββββββ | 415/625 [05:45<03:12, 1.09it/s]
Training 1/1 epoch (loss 2.9495): 67%|βββββββ | 416/625 [05:45<03:08, 1.11it/s]
Training 1/1 epoch (loss 3.2721): 67%|βββββββ | 416/625 [05:45<03:08, 1.11it/s]
Training 1/1 epoch (loss 3.2721): 67%|βββββββ | 417/625 [05:45<02:43, 1.27it/s]
Training 1/1 epoch (loss 3.2027): 67%|βββββββ | 417/625 [05:46<02:43, 1.27it/s]
Training 1/1 epoch (loss 3.2027): 67%|βββββββ | 418/625 [05:46<02:52, 1.20it/s]
Training 1/1 epoch (loss 3.3770): 67%|βββββββ | 418/625 [05:47<02:52, 1.20it/s]
Training 1/1 epoch (loss 3.3770): 67%|βββββββ | 419/625 [05:47<02:43, 1.26it/s]
Training 1/1 epoch (loss 3.3637): 67%|βββββββ | 419/625 [05:48<02:43, 1.26it/s]
Training 1/1 epoch (loss 3.3637): 67%|βββββββ | 420/625 [05:48<02:32, 1.35it/s]
Training 1/1 epoch (loss 2.9530): 67%|βββββββ | 420/625 [05:49<02:32, 1.35it/s]
Training 1/1 epoch (loss 2.9530): 67%|βββββββ | 421/625 [05:49<02:59, 1.14it/s]
Training 1/1 epoch (loss 3.1426): 67%|βββββββ | 421/625 [05:50<02:59, 1.14it/s]
Training 1/1 epoch (loss 3.1426): 68%|βββββββ | 422/625 [05:50<02:59, 1.13it/s]
Training 1/1 epoch (loss 3.2524): 68%|βββββββ | 422/625 [05:50<02:59, 1.13it/s]
Training 1/1 epoch (loss 3.2524): 68%|βββββββ | 423/625 [05:50<02:38, 1.28it/s]
Training 1/1 epoch (loss 3.3814): 68%|βββββββ | 423/625 [05:51<02:38, 1.28it/s]
Training 1/1 epoch (loss 3.3814): 68%|βββββββ | 424/625 [05:51<02:51, 1.17it/s]
Training 1/1 epoch (loss 3.3183): 68%|βββββββ | 424/625 [05:52<02:51, 1.17it/s]
Training 1/1 epoch (loss 3.3183): 68%|βββββββ | 425/625 [05:52<02:48, 1.19it/s]
Training 1/1 epoch (loss 3.2185): 68%|βββββββ | 425/625 [05:53<02:48, 1.19it/s]
Training 1/1 epoch (loss 3.2185): 68%|βββββββ | 426/625 [05:53<02:46, 1.20it/s]
Training 1/1 epoch (loss 3.0941): 68%|βββββββ | 426/625 [05:53<02:46, 1.20it/s]
Training 1/1 epoch (loss 3.0941): 68%|βββββββ | 427/625 [05:53<02:24, 1.37it/s]
Training 1/1 epoch (loss 3.0582): 68%|βββββββ | 427/625 [05:54<02:24, 1.37it/s]
Training 1/1 epoch (loss 3.0582): 68%|βββββββ | 428/625 [05:54<02:34, 1.27it/s]
Training 1/1 epoch (loss 3.1872): 68%|βββββββ | 428/625 [05:55<02:34, 1.27it/s]
Training 1/1 epoch (loss 3.1872): 69%|βββββββ | 429/625 [05:55<02:37, 1.25it/s]
Training 1/1 epoch (loss 3.5236): 69%|βββββββ | 429/625 [05:56<02:37, 1.25it/s]
Training 1/1 epoch (loss 3.5236): 69%|βββββββ | 430/625 [05:56<02:34, 1.27it/s]
Training 1/1 epoch (loss 3.1969): 69%|βββββββ | 430/625 [05:56<02:34, 1.27it/s]
Training 1/1 epoch (loss 3.1969): 69%|βββββββ | 431/625 [05:56<02:13, 1.45it/s]
Training 1/1 epoch (loss 2.8341): 69%|βββββββ | 431/625 [05:58<02:13, 1.45it/s]
Training 1/1 epoch (loss 2.8341): 69%|βββββββ | 432/625 [05:58<02:54, 1.11it/s]
Training 1/1 epoch (loss 2.8407): 69%|βββββββ | 432/625 [05:59<02:54, 1.11it/s]
Training 1/1 epoch (loss 2.8407): 69%|βββββββ | 433/625 [05:59<03:03, 1.05it/s]
Training 1/1 epoch (loss 3.1362): 69%|βββββββ | 433/625 [05:59<03:03, 1.05it/s]
Training 1/1 epoch (loss 3.1362): 69%|βββββββ | 434/625 [05:59<02:37, 1.22it/s]
Training 1/1 epoch (loss 3.4080): 69%|βββββββ | 434/625 [06:00<02:37, 1.22it/s]
Training 1/1 epoch (loss 3.4080): 70%|βββββββ | 435/625 [06:00<02:29, 1.27it/s]
Training 1/1 epoch (loss 3.1540): 70%|βββββββ | 435/625 [06:01<02:29, 1.27it/s]
Training 1/1 epoch (loss 3.1540): 70%|βββββββ | 436/625 [06:01<02:33, 1.23it/s]
Training 1/1 epoch (loss 3.0995): 70%|βββββββ | 436/625 [06:01<02:33, 1.23it/s]
Training 1/1 epoch (loss 3.0995): 70%|βββββββ | 437/625 [06:01<02:11, 1.42it/s]
Training 1/1 epoch (loss 3.5420): 70%|βββββββ | 437/625 [06:02<02:11, 1.42it/s]
Training 1/1 epoch (loss 3.5420): 70%|βββββββ | 438/625 [06:02<01:58, 1.58it/s]
Training 1/1 epoch (loss 3.2643): 70%|βββββββ | 438/625 [06:03<01:58, 1.58it/s]
Training 1/1 epoch (loss 3.2643): 70%|βββββββ | 439/625 [06:03<02:13, 1.39it/s]
Training 1/1 epoch (loss 3.2799): 70%|βββββββ | 439/625 [06:04<02:13, 1.39it/s]
Training 1/1 epoch (loss 3.2799): 70%|βββββββ | 440/625 [06:04<02:54, 1.06it/s]
Training 1/1 epoch (loss 3.2053): 70%|βββββββ | 440/625 [06:05<02:54, 1.06it/s]
Training 1/1 epoch (loss 3.2053): 71%|βββββββ | 441/625 [06:05<02:29, 1.23it/s]
Training 1/1 epoch (loss 3.3308): 71%|βββββββ | 441/625 [06:06<02:29, 1.23it/s]
Training 1/1 epoch (loss 3.3308): 71%|βββββββ | 442/625 [06:06<02:30, 1.21it/s]
Training 1/1 epoch (loss 3.1951): 71%|βββββββ | 442/625 [06:06<02:30, 1.21it/s]
Training 1/1 epoch (loss 3.1951): 71%|βββββββ | 443/625 [06:06<02:18, 1.32it/s]
Training 1/1 epoch (loss 3.1754): 71%|βββββββ | 443/625 [06:07<02:18, 1.32it/s]
Training 1/1 epoch (loss 3.1754): 71%|βββββββ | 444/625 [06:07<02:22, 1.27it/s]
Training 1/1 epoch (loss 3.1591): 71%|βββββββ | 444/625 [06:08<02:22, 1.27it/s]
Training 1/1 epoch (loss 3.1591): 71%|βββββββ | 445/625 [06:08<02:15, 1.33it/s]
Training 1/1 epoch (loss 3.3175): 71%|βββββββ | 445/625 [06:08<02:15, 1.33it/s]
Training 1/1 epoch (loss 3.3175): 71%|ββββββββ | 446/625 [06:08<02:09, 1.38it/s]
Training 1/1 epoch (loss 3.4559): 71%|ββββββββ | 446/625 [06:09<02:09, 1.38it/s]
Training 1/1 epoch (loss 3.4559): 72%|ββββββββ | 447/625 [06:09<02:19, 1.28it/s]
Training 1/1 epoch (loss 3.1433): 72%|ββββββββ | 447/625 [06:10<02:19, 1.28it/s]
Training 1/1 epoch (loss 3.1433): 72%|ββββββββ | 448/625 [06:10<02:01, 1.46it/s]
Training 1/1 epoch (loss 3.3883): 72%|ββββββββ | 448/625 [06:11<02:01, 1.46it/s]
Training 1/1 epoch (loss 3.3883): 72%|ββββββββ | 449/625 [06:11<02:22, 1.23it/s]
Training 1/1 epoch (loss 3.1418): 72%|ββββββββ | 449/625 [06:12<02:22, 1.23it/s]
Training 1/1 epoch (loss 3.1418): 72%|ββββββββ | 450/625 [06:12<02:26, 1.20it/s]
Training 1/1 epoch (loss 2.8694): 72%|ββββββββ | 450/625 [06:12<02:26, 1.20it/s]
Training 1/1 epoch (loss 2.8694): 72%|ββββββββ | 451/625 [06:12<02:04, 1.39it/s]
Training 1/1 epoch (loss 3.0577): 72%|ββββββββ | 451/625 [06:13<02:04, 1.39it/s]
Training 1/1 epoch (loss 3.0577): 72%|ββββββββ | 452/625 [06:13<02:13, 1.29it/s]
Training 1/1 epoch (loss 3.0372): 72%|ββββββββ | 452/625 [06:14<02:13, 1.29it/s]
Training 1/1 epoch (loss 3.0372): 72%|ββββββββ | 453/625 [06:14<02:28, 1.16it/s]
Training 1/1 epoch (loss 2.9794): 72%|ββββββββ | 453/625 [06:15<02:28, 1.16it/s]
Training 1/1 epoch (loss 2.9794): 73%|ββββββββ | 454/625 [06:15<02:39, 1.07it/s]
Training 1/1 epoch (loss 3.2196): 73%|ββββββββ | 454/625 [06:16<02:39, 1.07it/s]
Training 1/1 epoch (loss 3.2196): 73%|ββββββββ | 455/625 [06:16<02:09, 1.31it/s]
Training 1/1 epoch (loss 3.2256): 73%|ββββββββ | 455/625 [06:17<02:09, 1.31it/s]
Training 1/1 epoch (loss 3.2256): 73%|ββββββββ | 456/625 [06:17<02:38, 1.06it/s]
Training 1/1 epoch (loss 3.3645): 73%|ββββββββ | 456/625 [06:18<02:38, 1.06it/s]
Training 1/1 epoch (loss 3.3645): 73%|ββββββββ | 457/625 [06:18<02:43, 1.03it/s]
Training 1/1 epoch (loss 3.2947): 73%|ββββββββ | 457/625 [06:19<02:43, 1.03it/s]
Training 1/1 epoch (loss 3.2947): 73%|ββββββββ | 458/625 [06:19<02:21, 1.18it/s]
Training 1/1 epoch (loss 3.1665): 73%|ββββββββ | 458/625 [06:19<02:21, 1.18it/s]
Training 1/1 epoch (loss 3.1665): 73%|ββββββββ | 459/625 [06:19<02:23, 1.16it/s]
Training 1/1 epoch (loss 2.8900): 73%|ββββββββ | 459/625 [06:20<02:23, 1.16it/s]
Training 1/1 epoch (loss 2.8900): 74%|ββββββββ | 460/625 [06:20<02:25, 1.14it/s]
Training 1/1 epoch (loss 3.4469): 74%|ββββββββ | 460/625 [06:21<02:25, 1.14it/s]
Training 1/1 epoch (loss 3.4469): 74%|ββββββββ | 461/625 [06:21<02:26, 1.12it/s]
Training 1/1 epoch (loss 3.3325): 74%|ββββββββ | 461/625 [06:22<02:26, 1.12it/s]
Training 1/1 epoch (loss 3.3325): 74%|ββββββββ | 462/625 [06:22<02:00, 1.35it/s]
Training 1/1 epoch (loss 3.2134): 74%|ββββββββ | 462/625 [06:23<02:00, 1.35it/s]
Training 1/1 epoch (loss 3.2134): 74%|ββββββββ | 463/625 [06:23<02:32, 1.06it/s]
Training 1/1 epoch (loss 3.3385): 74%|ββββββββ | 463/625 [06:24<02:32, 1.06it/s]
Training 1/1 epoch (loss 3.3385): 74%|ββββββββ | 464/625 [06:24<02:19, 1.16it/s]
Training 1/1 epoch (loss 3.2775): 74%|ββββββββ | 464/625 [06:25<02:19, 1.16it/s]
Training 1/1 epoch (loss 3.2775): 74%|ββββββββ | 465/625 [06:25<02:13, 1.20it/s]
Training 1/1 epoch (loss 3.3672): 74%|ββββββββ | 465/625 [06:25<02:13, 1.20it/s]
Training 1/1 epoch (loss 3.3672): 75%|ββββββββ | 466/625 [06:25<02:09, 1.23it/s]
Training 1/1 epoch (loss 3.0527): 75%|ββββββββ | 466/625 [06:26<02:09, 1.23it/s]
Training 1/1 epoch (loss 3.0527): 75%|ββββββββ | 467/625 [06:26<02:11, 1.21it/s]
Training 1/1 epoch (loss 3.2346): 75%|ββββββββ | 467/625 [06:27<02:11, 1.21it/s]
Training 1/1 epoch (loss 3.2346): 75%|ββββββββ | 468/625 [06:27<02:10, 1.20it/s]
Training 1/1 epoch (loss 3.3978): 75%|ββββββββ | 468/625 [06:28<02:10, 1.20it/s]
Training 1/1 epoch (loss 3.3978): 75%|ββββββββ | 469/625 [06:28<01:52, 1.39it/s]
Training 1/1 epoch (loss 3.2128): 75%|ββββββββ | 469/625 [06:29<01:52, 1.39it/s]
Training 1/1 epoch (loss 3.2128): 75%|ββββββββ | 470/625 [06:29<02:23, 1.08it/s]
Training 1/1 epoch (loss 3.1324): 75%|ββββββββ | 470/625 [06:30<02:23, 1.08it/s]
Training 1/1 epoch (loss 3.1324): 75%|ββββββββ | 471/625 [06:30<02:21, 1.08it/s]
Training 1/1 epoch (loss 3.3712): 75%|ββββββββ | 471/625 [06:31<02:21, 1.08it/s]
Training 1/1 epoch (loss 3.3712): 76%|ββββββββ | 472/625 [06:31<02:17, 1.12it/s]
Training 1/1 epoch (loss 2.8696): 76%|ββββββββ | 472/625 [06:31<02:17, 1.12it/s]
Training 1/1 epoch (loss 2.8696): 76%|ββββββββ | 473/625 [06:31<01:59, 1.27it/s]
Training 1/1 epoch (loss 3.3069): 76%|ββββββββ | 473/625 [06:32<01:59, 1.27it/s]
Training 1/1 epoch (loss 3.3069): 76%|ββββββββ | 474/625 [06:32<01:56, 1.30it/s]
Training 1/1 epoch (loss 3.4110): 76%|ββββββββ | 474/625 [06:33<01:56, 1.30it/s]
Training 1/1 epoch (loss 3.4110): 76%|ββββββββ | 475/625 [06:33<02:08, 1.16it/s]
Training 1/1 epoch (loss 3.4697): 76%|ββββββββ | 475/625 [06:34<02:08, 1.16it/s]
Training 1/1 epoch (loss 3.4697): 76%|ββββββββ | 476/625 [06:34<01:56, 1.28it/s]
Training 1/1 epoch (loss 3.1552): 76%|ββββββββ | 476/625 [06:34<01:56, 1.28it/s]
Training 1/1 epoch (loss 3.1552): 76%|ββββββββ | 477/625 [06:34<01:58, 1.25it/s]
Training 1/1 epoch (loss 3.1868): 76%|ββββββββ | 477/625 [06:35<01:58, 1.25it/s]
Training 1/1 epoch (loss 3.1868): 76%|ββββββββ | 478/625 [06:35<02:01, 1.21it/s]
Training 1/1 epoch (loss 3.1318): 76%|ββββββββ | 478/625 [06:36<02:01, 1.21it/s]
Training 1/1 epoch (loss 3.1318): 77%|ββββββββ | 479/625 [06:36<01:57, 1.25it/s]
Training 1/1 epoch (loss 3.2493): 77%|ββββββββ | 479/625 [06:37<01:57, 1.25it/s]
Training 1/1 epoch (loss 3.2493): 77%|ββββββββ | 480/625 [06:37<02:19, 1.04it/s]
Training 1/1 epoch (loss 3.2316): 77%|ββββββββ | 480/625 [06:39<02:19, 1.04it/s]
Training 1/1 epoch (loss 3.2316): 77%|ββββββββ | 481/625 [06:39<03:00, 1.25s/it]
Training 1/1 epoch (loss 2.9015): 77%|ββββββββ | 481/625 [06:40<03:00, 1.25s/it]
Training 1/1 epoch (loss 2.9015): 77%|ββββββββ | 482/625 [06:40<02:31, 1.06s/it]
Training 1/1 epoch (loss 3.0486): 77%|ββββββββ | 482/625 [06:40<02:31, 1.06s/it]
Training 1/1 epoch (loss 3.0486): 77%|ββββββββ | 483/625 [06:40<02:03, 1.15it/s]
Training 1/1 epoch (loss 3.2202): 77%|ββββββββ | 483/625 [06:41<02:03, 1.15it/s]
Training 1/1 epoch (loss 3.2202): 77%|ββββββββ | 484/625 [06:41<02:10, 1.08it/s]
Training 1/1 epoch (loss 3.1825): 77%|ββββββββ | 484/625 [06:42<02:10, 1.08it/s]
Training 1/1 epoch (loss 3.1825): 78%|ββββββββ | 485/625 [06:42<01:57, 1.19it/s]
Training 1/1 epoch (loss 3.1241): 78%|ββββββββ | 485/625 [06:43<01:57, 1.19it/s]
Training 1/1 epoch (loss 3.1241): 78%|ββββββββ | 486/625 [06:43<01:56, 1.19it/s]
Training 1/1 epoch (loss 3.2831): 78%|ββββββββ | 486/625 [06:44<01:56, 1.19it/s]
Training 1/1 epoch (loss 3.2831): 78%|ββββββββ | 487/625 [06:44<01:59, 1.16it/s]
Training 1/1 epoch (loss 3.0268): 78%|ββββββββ | 487/625 [06:45<01:59, 1.16it/s]
Training 1/1 epoch (loss 3.0268): 78%|ββββββββ | 488/625 [06:45<02:26, 1.07s/it]
Training 1/1 epoch (loss 3.2565): 78%|ββββββββ | 488/625 [06:46<02:26, 1.07s/it]
Training 1/1 epoch (loss 3.2565): 78%|ββββββββ | 489/625 [06:46<02:00, 1.12it/s]
Training 1/1 epoch (loss 3.2277): 78%|ββββββββ | 489/625 [06:47<02:00, 1.12it/s]
Training 1/1 epoch (loss 3.2277): 78%|ββββββββ | 490/625 [06:47<01:55, 1.17it/s]
Training 1/1 epoch (loss 3.2187): 78%|ββββββββ | 490/625 [06:48<01:55, 1.17it/s]
Training 1/1 epoch (loss 3.2187): 79%|ββββββββ | 491/625 [06:48<01:56, 1.15it/s]
Training 1/1 epoch (loss 3.0052): 79%|ββββββββ | 491/625 [06:49<01:56, 1.15it/s]
Training 1/1 epoch (loss 3.0052): 79%|ββββββββ | 492/625 [06:49<02:08, 1.04it/s]
Training 1/1 epoch (loss 3.0381): 79%|ββββββββ | 492/625 [06:49<02:08, 1.04it/s]
Training 1/1 epoch (loss 3.0381): 79%|ββββββββ | 493/625 [06:49<01:56, 1.13it/s]
Training 1/1 epoch (loss 2.9458): 79%|ββββββββ | 493/625 [06:50<01:56, 1.13it/s]
Training 1/1 epoch (loss 2.9458): 79%|ββββββββ | 494/625 [06:50<01:54, 1.14it/s]
Training 1/1 epoch (loss 2.9459): 79%|ββββββββ | 494/625 [06:51<01:54, 1.14it/s]
Training 1/1 epoch (loss 2.9459): 79%|ββββββββ | 495/625 [06:51<01:49, 1.18it/s]
Training 1/1 epoch (loss 3.1441): 79%|ββββββββ | 495/625 [06:51<01:49, 1.18it/s]
Training 1/1 epoch (loss 3.1441): 79%|ββββββββ | 496/625 [06:51<01:32, 1.39it/s]
Training 1/1 epoch (loss 3.2770): 79%|ββββββββ | 496/625 [06:53<01:32, 1.39it/s]
Training 1/1 epoch (loss 3.2770): 80%|ββββββββ | 497/625 [06:53<01:46, 1.21it/s]
Training 1/1 epoch (loss 3.0614): 80%|ββββββββ | 497/625 [06:53<01:46, 1.21it/s]
Training 1/1 epoch (loss 3.0614): 80%|ββββββββ | 498/625 [06:53<01:48, 1.17it/s]
Training 1/1 epoch (loss 3.2860): 80%|ββββββββ | 498/625 [06:54<01:48, 1.17it/s]
Training 1/1 epoch (loss 3.2860): 80%|ββββββββ | 499/625 [06:54<01:38, 1.27it/s]
Training 1/1 epoch (loss 2.9683): 80%|ββββββββ | 499/625 [06:55<01:38, 1.27it/s]
Training 1/1 epoch (loss 2.9683): 80%|ββββββββ | 500/625 [06:55<01:30, 1.38it/s]
Training 1/1 epoch (loss 3.1558): 80%|ββββββββ | 500/625 [06:56<01:30, 1.38it/s]
Training 1/1 epoch (loss 3.1558): 80%|ββββββββ | 501/625 [06:56<01:35, 1.29it/s]
Training 1/1 epoch (loss 3.2447): 80%|ββββββββ | 501/625 [06:56<01:35, 1.29it/s]
Training 1/1 epoch (loss 3.2447): 80%|ββββββββ | 502/625 [06:56<01:41, 1.21it/s]
Training 1/1 epoch (loss 3.1715): 80%|ββββββββ | 502/625 [06:57<01:41, 1.21it/s]
Training 1/1 epoch (loss 3.1715): 80%|ββββββββ | 503/625 [06:57<01:25, 1.43it/s]
Training 1/1 epoch (loss 3.3321): 80%|ββββββββ | 503/625 [06:58<01:25, 1.43it/s]
Training 1/1 epoch (loss 3.3321): 81%|ββββββββ | 504/625 [06:58<01:51, 1.08it/s]
Training 1/1 epoch (loss 3.1904): 81%|ββββββββ | 504/625 [06:59<01:51, 1.08it/s]
Training 1/1 epoch (loss 3.1904): 81%|ββββββββ | 505/625 [06:59<01:57, 1.02it/s]
Training 1/1 epoch (loss 3.3381): 81%|ββββββββ | 505/625 [07:00<01:57, 1.02it/s]
Training 1/1 epoch (loss 3.3381): 81%|ββββββββ | 506/625 [07:00<01:38, 1.21it/s]
Training 1/1 epoch (loss 3.0770): 81%|ββββββββ | 506/625 [07:01<01:38, 1.21it/s]
Training 1/1 epoch (loss 3.0770): 81%|ββββββββ | 507/625 [07:01<01:36, 1.23it/s]
Training 1/1 epoch (loss 3.0273): 81%|ββββββββ | 507/625 [07:02<01:36, 1.23it/s]
Training 1/1 epoch (loss 3.0273): 81%|βββββββββ | 508/625 [07:02<01:38, 1.19it/s]
Training 1/1 epoch (loss 3.1281): 81%|βββββββββ | 508/625 [07:02<01:38, 1.19it/s]
Training 1/1 epoch (loss 3.1281): 81%|βββββββββ | 509/625 [07:02<01:34, 1.22it/s]
Training 1/1 epoch (loss 3.0986): 81%|βββββββββ | 509/625 [07:03<01:34, 1.22it/s]
Training 1/1 epoch (loss 3.0986): 82%|βββββββββ | 510/625 [07:03<01:21, 1.40it/s]
Training 1/1 epoch (loss 3.4157): 82%|βββββββββ | 510/625 [07:04<01:21, 1.40it/s]
Training 1/1 epoch (loss 3.4157): 82%|βββββββββ | 511/625 [07:04<01:21, 1.40it/s]
Training 1/1 epoch (loss 2.8411): 82%|βββββββββ | 511/625 [07:05<01:21, 1.40it/s]
Training 1/1 epoch (loss 2.8411): 82%|βββββββββ | 512/625 [07:05<01:59, 1.06s/it]
Training 1/1 epoch (loss 2.9672): 82%|βββββββββ | 512/625 [07:06<01:59, 1.06s/it]
Training 1/1 epoch (loss 2.9672): 82%|βββββββββ | 513/625 [07:06<01:37, 1.15it/s]
Training 1/1 epoch (loss 3.1977): 82%|βββββββββ | 513/625 [07:07<01:37, 1.15it/s]
Training 1/1 epoch (loss 3.1977): 82%|βββββββββ | 514/625 [07:07<01:38, 1.13it/s]
Training 1/1 epoch (loss 3.1112): 82%|βββββββββ | 514/625 [07:08<01:38, 1.13it/s]
Training 1/1 epoch (loss 3.1112): 82%|βββββββββ | 515/625 [07:08<01:51, 1.01s/it]
Training 1/1 epoch (loss 3.3369): 82%|βββββββββ | 515/625 [07:09<01:51, 1.01s/it]
Training 1/1 epoch (loss 3.3369): 83%|βββββββββ | 516/625 [07:09<01:36, 1.13it/s]
Training 1/1 epoch (loss 3.1110): 83%|βββββββββ | 516/625 [07:10<01:36, 1.13it/s]
Training 1/1 epoch (loss 3.1110): 83%|βββββββββ | 517/625 [07:10<01:35, 1.14it/s]
Training 1/1 epoch (loss 3.3455): 83%|βββββββββ | 517/625 [07:10<01:35, 1.14it/s]
Training 1/1 epoch (loss 3.3455): 83%|βββββββββ | 518/625 [07:10<01:34, 1.14it/s]
Training 1/1 epoch (loss 3.4122): 83%|βββββββββ | 518/625 [07:11<01:34, 1.14it/s]
Training 1/1 epoch (loss 3.4122): 83%|βββββββββ | 519/625 [07:11<01:39, 1.07it/s]
Training 1/1 epoch (loss 3.3470): 83%|βββββββββ | 519/625 [07:13<01:39, 1.07it/s]
Training 1/1 epoch (loss 3.3470): 83%|βββββββββ | 520/625 [07:13<01:45, 1.01s/it]
Training 1/1 epoch (loss 3.3413): 83%|βββββββββ | 520/625 [07:14<01:45, 1.01s/it]
Training 1/1 epoch (loss 3.3413): 83%|βββββββββ | 521/625 [07:14<01:47, 1.03s/it]
Training 1/1 epoch (loss 3.1326): 83%|βββββββββ | 521/625 [07:15<01:47, 1.03s/it]
Training 1/1 epoch (loss 3.1326): 84%|βββββββββ | 522/625 [07:15<01:44, 1.02s/it]
Training 1/1 epoch (loss 3.1687): 84%|βββββββββ | 522/625 [07:15<01:44, 1.02s/it]
Training 1/1 epoch (loss 3.1687): 84%|βββββββββ | 523/625 [07:15<01:28, 1.15it/s]
Training 1/1 epoch (loss 3.1254): 84%|βββββββββ | 523/625 [07:16<01:28, 1.15it/s]
Training 1/1 epoch (loss 3.1254): 84%|βββββββββ | 524/625 [07:16<01:28, 1.14it/s]
Training 1/1 epoch (loss 3.2288): 84%|βββββββββ | 524/625 [07:17<01:28, 1.14it/s]
Training 1/1 epoch (loss 3.2288): 84%|βββββββββ | 525/625 [07:17<01:28, 1.13it/s]
Training 1/1 epoch (loss 3.4430): 84%|βββββββββ | 525/625 [07:18<01:28, 1.13it/s]
Training 1/1 epoch (loss 3.4430): 84%|βββββββββ | 526/625 [07:18<01:26, 1.15it/s]
Training 1/1 epoch (loss 3.3004): 84%|βββββββββ | 526/625 [07:18<01:26, 1.15it/s]
Training 1/1 epoch (loss 3.3004): 84%|βββββββββ | 527/625 [07:18<01:14, 1.32it/s]
Training 1/1 epoch (loss 3.1164): 84%|βββββββββ | 527/625 [07:20<01:14, 1.32it/s]
Training 1/1 epoch (loss 3.1164): 84%|βββββββββ | 528/625 [07:20<01:32, 1.05it/s]
Training 1/1 epoch (loss 3.1060): 84%|βββββββββ | 528/625 [07:21<01:32, 1.05it/s]
Training 1/1 epoch (loss 3.1060): 85%|βββββββββ | 529/625 [07:21<01:34, 1.02it/s]
Training 1/1 epoch (loss 3.0069): 85%|βββββββββ | 529/625 [07:21<01:34, 1.02it/s]
Training 1/1 epoch (loss 3.0069): 85%|βββββββββ | 530/625 [07:21<01:21, 1.17it/s]
Training 1/1 epoch (loss 3.3327): 85%|βββββββββ | 530/625 [07:23<01:21, 1.17it/s]
Training 1/1 epoch (loss 3.3327): 85%|βββββββββ | 531/625 [07:23<01:33, 1.01it/s]
Training 1/1 epoch (loss 3.1732): 85%|βββββββββ | 531/625 [07:24<01:33, 1.01it/s]
Training 1/1 epoch (loss 3.1732): 85%|βββββββββ | 532/625 [07:24<01:30, 1.03it/s]
Training 1/1 epoch (loss 3.3146): 85%|βββββββββ | 532/625 [07:24<01:30, 1.03it/s]
Training 1/1 epoch (loss 3.3146): 85%|βββββββββ | 533/625 [07:24<01:15, 1.21it/s]
Training 1/1 epoch (loss 3.4489): 85%|βββββββββ | 533/625 [07:25<01:15, 1.21it/s]
Training 1/1 epoch (loss 3.4489): 85%|βββββββββ | 534/625 [07:25<01:21, 1.11it/s]
Training 1/1 epoch (loss 3.0108): 85%|βββββββββ | 534/625 [07:26<01:21, 1.11it/s]
Training 1/1 epoch (loss 3.0108): 86%|βββββββββ | 535/625 [07:26<01:26, 1.04it/s]
Training 1/1 epoch (loss 3.2476): 86%|βββββββββ | 535/625 [07:27<01:26, 1.04it/s]
Training 1/1 epoch (loss 3.2476): 86%|βββββββββ | 536/625 [07:27<01:20, 1.11it/s]
Training 1/1 epoch (loss 3.3768): 86%|βββββββββ | 536/625 [07:28<01:20, 1.11it/s]
Training 1/1 epoch (loss 3.3768): 86%|βββββββββ | 537/625 [07:28<01:15, 1.16it/s]
Training 1/1 epoch (loss 3.0010): 86%|βββββββββ | 537/625 [07:29<01:15, 1.16it/s]
Training 1/1 epoch (loss 3.0010): 86%|βββββββββ | 538/625 [07:29<01:31, 1.05s/it]
Training 1/1 epoch (loss 3.0869): 86%|βββββββββ | 538/625 [07:30<01:31, 1.05s/it]
Training 1/1 epoch (loss 3.0869): 86%|βββββββββ | 539/625 [07:30<01:22, 1.04it/s]
Training 1/1 epoch (loss 3.0067): 86%|βββββββββ | 539/625 [07:31<01:22, 1.04it/s]
Training 1/1 epoch (loss 3.0067): 86%|βββββββββ | 540/625 [07:31<01:14, 1.14it/s]
Training 1/1 epoch (loss 2.9685): 86%|βββββββββ | 540/625 [07:32<01:14, 1.14it/s]
Training 1/1 epoch (loss 2.9685): 87%|βββββββββ | 541/625 [07:32<01:14, 1.13it/s]
Training 1/1 epoch (loss 3.1919): 87%|βββββββββ | 541/625 [07:33<01:14, 1.13it/s]
Training 1/1 epoch (loss 3.1919): 87%|βββββββββ | 542/625 [07:33<01:16, 1.08it/s]
Training 1/1 epoch (loss 3.1725): 87%|βββββββββ | 542/625 [07:33<01:16, 1.08it/s]
Training 1/1 epoch (loss 3.1725): 87%|βββββββββ | 543/625 [07:33<01:06, 1.24it/s]
Training 1/1 epoch (loss 3.3227): 87%|βββββββββ | 543/625 [07:35<01:06, 1.24it/s]
Training 1/1 epoch (loss 3.3227): 87%|βββββββββ | 544/625 [07:35<01:20, 1.01it/s]
Training 1/1 epoch (loss 3.4802): 87%|βββββββββ | 544/625 [07:36<01:20, 1.01it/s]
Training 1/1 epoch (loss 3.4802): 87%|βββββββββ | 545/625 [07:36<01:24, 1.06s/it]
Training 1/1 epoch (loss 3.3037): 87%|βββββββββ | 545/625 [07:36<01:24, 1.06s/it]
Training 1/1 epoch (loss 3.3037): 87%|βββββββββ | 546/625 [07:36<01:10, 1.13it/s]
Training 1/1 epoch (loss 3.0630): 87%|βββββββββ | 546/625 [07:37<01:10, 1.13it/s]
Training 1/1 epoch (loss 3.0630): 88%|βββββββββ | 547/625 [07:37<01:04, 1.22it/s]
Training 1/1 epoch (loss 2.8892): 88%|βββββββββ | 547/625 [07:38<01:04, 1.22it/s]
Training 1/1 epoch (loss 2.8892): 88%|βββββββββ | 548/625 [07:38<01:04, 1.19it/s]
Training 1/1 epoch (loss 3.2042): 88%|βββββββββ | 548/625 [07:39<01:04, 1.19it/s]
Training 1/1 epoch (loss 3.2042): 88%|βββββββββ | 549/625 [07:39<01:02, 1.22it/s]
Training 1/1 epoch (loss 3.0343): 88%|βββββββββ | 549/625 [07:39<01:02, 1.22it/s]
Training 1/1 epoch (loss 3.0343): 88%|βββββββββ | 550/625 [07:39<01:01, 1.22it/s]
Training 1/1 epoch (loss 3.3755): 88%|βββββββββ | 550/625 [07:40<01:01, 1.22it/s]
Training 1/1 epoch (loss 3.3755): 88%|βββββββββ | 551/625 [07:40<01:03, 1.17it/s]
Training 1/1 epoch (loss 3.0532): 88%|βββββββββ | 551/625 [07:42<01:03, 1.17it/s]
Training 1/1 epoch (loss 3.0532): 88%|βββββββββ | 552/625 [07:42<01:11, 1.03it/s]
Training 1/1 epoch (loss 3.0322): 88%|βββββββββ | 552/625 [07:42<01:11, 1.03it/s]
Training 1/1 epoch (loss 3.0322): 88%|βββββββββ | 553/625 [07:42<00:57, 1.25it/s]
Training 1/1 epoch (loss 3.1450): 88%|βββββββββ | 553/625 [07:43<00:57, 1.25it/s]
Training 1/1 epoch (loss 3.1450): 89%|βββββββββ | 554/625 [07:43<00:59, 1.20it/s]
Training 1/1 epoch (loss 3.2258): 89%|βββββββββ | 554/625 [07:44<00:59, 1.20it/s]
Training 1/1 epoch (loss 3.2258): 89%|βββββββββ | 555/625 [07:44<01:00, 1.17it/s]
Training 1/1 epoch (loss 2.9476): 89%|βββββββββ | 555/625 [07:44<01:00, 1.17it/s]
Training 1/1 epoch (loss 2.9476): 89%|βββββββββ | 556/625 [07:44<00:52, 1.31it/s]
Training 1/1 epoch (loss 3.1235): 89%|βββββββββ | 556/625 [07:45<00:52, 1.31it/s]
Training 1/1 epoch (loss 3.1235): 89%|βββββββββ | 557/625 [07:45<00:53, 1.28it/s]
Training 1/1 epoch (loss 3.2506): 89%|βββββββββ | 557/625 [07:46<00:53, 1.28it/s]
Training 1/1 epoch (loss 3.2506): 89%|βββββββββ | 558/625 [07:46<00:55, 1.22it/s]
Training 1/1 epoch (loss 2.9898): 89%|βββββββββ | 558/625 [07:47<00:55, 1.22it/s]
Training 1/1 epoch (loss 2.9898): 89%|βββββββββ | 559/625 [07:47<00:55, 1.19it/s]
Training 1/1 epoch (loss 3.0849): 89%|βββββββββ | 559/625 [07:48<00:55, 1.19it/s]
Training 1/1 epoch (loss 3.0849): 90%|βββββββββ | 560/625 [07:48<00:55, 1.17it/s]
Training 1/1 epoch (loss 3.2233): 90%|βββββββββ | 560/625 [07:49<00:55, 1.17it/s]
Training 1/1 epoch (loss 3.2233): 90%|βββββββββ | 561/625 [07:49<00:54, 1.18it/s]
Training 1/1 epoch (loss 3.2476): 90%|βββββββββ | 561/625 [07:50<00:54, 1.18it/s]
Training 1/1 epoch (loss 3.2476): 90%|βββββββββ | 562/625 [07:50<00:54, 1.15it/s]
Training 1/1 epoch (loss 3.1971): 90%|βββββββββ | 562/625 [07:50<00:54, 1.15it/s]
Training 1/1 epoch (loss 3.1971): 90%|βββββββββ | 563/625 [07:50<00:45, 1.38it/s]
Training 1/1 epoch (loss 2.8237): 90%|βββββββββ | 563/625 [07:51<00:45, 1.38it/s]
Training 1/1 epoch (loss 2.8237): 90%|βββββββββ | 564/625 [07:51<00:47, 1.28it/s]
Training 1/1 epoch (loss 3.1508): 90%|βββββββββ | 564/625 [07:52<00:47, 1.28it/s]
Training 1/1 epoch (loss 3.1508): 90%|βββββββββ | 565/625 [07:52<00:55, 1.08it/s]
Training 1/1 epoch (loss 3.1138): 90%|βββββββββ | 565/625 [07:53<00:55, 1.08it/s]
Training 1/1 epoch (loss 3.1138): 91%|βββββββββ | 566/625 [07:53<00:48, 1.22it/s]
Training 1/1 epoch (loss 2.9585): 91%|βββββββββ | 566/625 [07:54<00:48, 1.22it/s]
Training 1/1 epoch (loss 2.9585): 91%|βββββββββ | 567/625 [07:54<00:45, 1.27it/s]
Training 1/1 epoch (loss 3.2146): 91%|βββββββββ | 567/625 [07:55<00:45, 1.27it/s]
Training 1/1 epoch (loss 3.2146): 91%|βββββββββ | 568/625 [07:55<00:50, 1.14it/s]
Training 1/1 epoch (loss 3.4083): 91%|βββββββββ | 568/625 [07:56<00:50, 1.14it/s]
Training 1/1 epoch (loss 3.4083): 91%|βββββββββ | 569/625 [07:56<00:49, 1.12it/s]
Training 1/1 epoch (loss 3.4235): 91%|βββββββββ | 569/625 [07:57<00:49, 1.12it/s]
Training 1/1 epoch (loss 3.4235): 91%|βββββββββ | 570/625 [07:57<00:53, 1.03it/s]
Training 1/1 epoch (loss 3.3758): 91%|βββββββββ | 570/625 [07:58<00:53, 1.03it/s]
Training 1/1 epoch (loss 3.3758): 91%|ββββββββββ| 571/625 [07:58<01:03, 1.18s/it]
Training 1/1 epoch (loss 3.4513): 91%|ββββββββββ| 571/625 [07:59<01:03, 1.18s/it]
Training 1/1 epoch (loss 3.4513): 92%|ββββββββββ| 572/625 [07:59<00:51, 1.04it/s]
Training 1/1 epoch (loss 3.1768): 92%|ββββββββββ| 572/625 [08:00<00:51, 1.04it/s]
Training 1/1 epoch (loss 3.1768): 92%|ββββββββββ| 573/625 [08:00<00:50, 1.03it/s]
Training 1/1 epoch (loss 3.0347): 92%|ββββββββββ| 573/625 [08:01<00:50, 1.03it/s]
Training 1/1 epoch (loss 3.0347): 92%|ββββββββββ| 574/625 [08:01<00:50, 1.01it/s]
Training 1/1 epoch (loss 2.9312): 92%|ββββββββββ| 574/625 [08:01<00:50, 1.01it/s]
Training 1/1 epoch (loss 2.9312): 92%|ββββββββββ| 575/625 [08:01<00:42, 1.18it/s]
Training 1/1 epoch (loss 2.9049): 92%|ββββββββββ| 575/625 [08:03<00:42, 1.18it/s]
Training 1/1 epoch (loss 2.9049): 92%|ββββββββββ| 576/625 [08:03<00:57, 1.17s/it]
Training 1/1 epoch (loss 3.1063): 92%|ββββββββββ| 576/625 [08:04<00:57, 1.17s/it]
Training 1/1 epoch (loss 3.1063): 92%|ββββββββββ| 577/625 [08:04<00:54, 1.13s/it]
Training 1/1 epoch (loss 3.1599): 92%|ββββββββββ| 577/625 [08:05<00:54, 1.13s/it]
Training 1/1 epoch (loss 3.1599): 92%|ββββββββββ| 578/625 [08:05<00:44, 1.05it/s]
Training 1/1 epoch (loss 3.3452): 92%|ββββββββββ| 578/625 [08:06<00:44, 1.05it/s]
Training 1/1 epoch (loss 3.3452): 93%|ββββββββββ| 579/625 [08:06<00:48, 1.06s/it]
Training 1/1 epoch (loss 3.0319): 93%|ββββββββββ| 579/625 [08:07<00:48, 1.06s/it]
Training 1/1 epoch (loss 3.0319): 93%|ββββββββββ| 580/625 [08:07<00:47, 1.06s/it]
Training 1/1 epoch (loss 3.0836): 93%|ββββββββββ| 580/625 [08:08<00:47, 1.06s/it]
Training 1/1 epoch (loss 3.0836): 93%|ββββββββββ| 581/625 [08:08<00:41, 1.05it/s]
Training 1/1 epoch (loss 3.2674): 93%|ββββββββββ| 581/625 [08:09<00:41, 1.05it/s]
Training 1/1 epoch (loss 3.2674): 93%|ββββββββββ| 582/625 [08:09<00:40, 1.06it/s]
Training 1/1 epoch (loss 3.3132): 93%|ββββββββββ| 582/625 [08:10<00:40, 1.06it/s]
Training 1/1 epoch (loss 3.3132): 93%|ββββββββββ| 583/625 [08:10<00:38, 1.09it/s]
Training 1/1 epoch (loss 2.8566): 93%|ββββββββββ| 583/625 [08:11<00:38, 1.09it/s]
Training 1/1 epoch (loss 2.8566): 93%|ββββββββββ| 584/625 [08:11<00:36, 1.13it/s]
Training 1/1 epoch (loss 3.1260): 93%|ββββββββββ| 584/625 [08:11<00:36, 1.13it/s]
Training 1/1 epoch (loss 3.1260): 94%|ββββββββββ| 585/625 [08:11<00:30, 1.32it/s]
Training 1/1 epoch (loss 2.9971): 94%|ββββββββββ| 585/625 [08:12<00:30, 1.32it/s]
Training 1/1 epoch (loss 2.9971): 94%|ββββββββββ| 586/625 [08:12<00:31, 1.25it/s]
Training 1/1 epoch (loss 3.3030): 94%|ββββββββββ| 586/625 [08:13<00:31, 1.25it/s]
Training 1/1 epoch (loss 3.3030): 94%|ββββββββββ| 587/625 [08:13<00:31, 1.20it/s]
Training 1/1 epoch (loss 3.1697): 94%|ββββββββββ| 587/625 [08:13<00:31, 1.20it/s]
Training 1/1 epoch (loss 3.1697): 94%|ββββββββββ| 588/625 [08:13<00:29, 1.26it/s]
Training 1/1 epoch (loss 3.2314): 94%|ββββββββββ| 588/625 [08:14<00:29, 1.26it/s]
Training 1/1 epoch (loss 3.2314): 94%|ββββββββββ| 589/625 [08:14<00:29, 1.22it/s]
Training 1/1 epoch (loss 3.0275): 94%|ββββββββββ| 589/625 [08:16<00:29, 1.22it/s]
Training 1/1 epoch (loss 3.0275): 94%|ββββββββββ| 590/625 [08:16<00:34, 1.00it/s]
Training 1/1 epoch (loss 3.3977): 94%|ββββββββββ| 590/625 [08:17<00:34, 1.00it/s]
Training 1/1 epoch (loss 3.3977): 95%|ββββββββββ| 591/625 [08:17<00:31, 1.09it/s]
Training 1/1 epoch (loss 3.0458): 95%|ββββββββββ| 591/625 [08:18<00:31, 1.09it/s]
Training 1/1 epoch (loss 3.0458): 95%|ββββββββββ| 592/625 [08:18<00:34, 1.06s/it]
Training 1/1 epoch (loss 3.1764): 95%|ββββββββββ| 592/625 [08:19<00:34, 1.06s/it]
Training 1/1 epoch (loss 3.1764): 95%|ββββββββββ| 593/625 [08:19<00:34, 1.07s/it]
Training 1/1 epoch (loss 3.1561): 95%|ββββββββββ| 593/625 [08:20<00:34, 1.07s/it]
Training 1/1 epoch (loss 3.1561): 95%|ββββββββββ| 594/625 [08:20<00:31, 1.00s/it]
Training 1/1 epoch (loss 2.9824): 95%|ββββββββββ| 594/625 [08:20<00:31, 1.00s/it]
Training 1/1 epoch (loss 2.9824): 95%|ββββββββββ| 595/625 [08:20<00:25, 1.18it/s]
Training 1/1 epoch (loss 3.0605): 95%|ββββββββββ| 595/625 [08:21<00:25, 1.18it/s]
Training 1/1 epoch (loss 3.0605): 95%|ββββββββββ| 596/625 [08:21<00:22, 1.29it/s]
Training 1/1 epoch (loss 3.0079): 95%|ββββββββββ| 596/625 [08:22<00:22, 1.29it/s]
Training 1/1 epoch (loss 3.0079): 96%|ββββββββββ| 597/625 [08:22<00:23, 1.19it/s]
Training 1/1 epoch (loss 3.1418): 96%|ββββββββββ| 597/625 [08:22<00:23, 1.19it/s]
Training 1/1 epoch (loss 3.1418): 96%|ββββββββββ| 598/625 [08:22<00:20, 1.33it/s]
Training 1/1 epoch (loss 3.2765): 96%|ββββββββββ| 598/625 [08:23<00:20, 1.33it/s]
Training 1/1 epoch (loss 3.2765): 96%|ββββββββββ| 599/625 [08:23<00:19, 1.36it/s]
Training 1/1 epoch (loss 3.2248): 96%|ββββββββββ| 599/625 [08:24<00:19, 1.36it/s]
Training 1/1 epoch (loss 3.2248): 96%|ββββββββββ| 600/625 [08:24<00:20, 1.24it/s]
Training 1/1 epoch (loss 3.0850): 96%|ββββββββββ| 600/625 [08:25<00:20, 1.24it/s]
Training 1/1 epoch (loss 3.0850): 96%|ββββββββββ| 601/625 [08:25<00:21, 1.12it/s]
Training 1/1 epoch (loss 3.1071): 96%|ββββββββββ| 601/625 [08:26<00:21, 1.12it/s]
Training 1/1 epoch (loss 3.1071): 96%|ββββββββββ| 602/625 [08:26<00:17, 1.34it/s]
Training 1/1 epoch (loss 3.0168): 96%|ββββββββββ| 602/625 [08:27<00:17, 1.34it/s]
Training 1/1 epoch (loss 3.0168): 96%|ββββββββββ| 603/625 [08:27<00:17, 1.27it/s]
Training 1/1 epoch (loss 3.1504): 96%|ββββββββββ| 603/625 [08:27<00:17, 1.27it/s]
Training 1/1 epoch (loss 3.1504): 97%|ββββββββββ| 604/625 [08:27<00:16, 1.26it/s]
Training 1/1 epoch (loss 3.2647): 97%|ββββββββββ| 604/625 [08:28<00:16, 1.26it/s]
Training 1/1 epoch (loss 3.2647): 97%|ββββββββββ| 605/625 [08:28<00:15, 1.30it/s]
Training 1/1 epoch (loss 3.0867): 97%|ββββββββββ| 605/625 [08:29<00:15, 1.30it/s]
Training 1/1 epoch (loss 3.0867): 97%|ββββββββββ| 606/625 [08:29<00:13, 1.40it/s]
Training 1/1 epoch (loss 3.2044): 97%|ββββββββββ| 606/625 [08:30<00:13, 1.40it/s]
Training 1/1 epoch (loss 3.2044): 97%|ββββββββββ| 607/625 [08:30<00:13, 1.30it/s]
Training 1/1 epoch (loss 3.2112): 97%|ββββββββββ| 607/625 [08:31<00:13, 1.30it/s]
Training 1/1 epoch (loss 3.2112): 97%|ββββββββββ| 608/625 [08:31<00:14, 1.17it/s]
Training 1/1 epoch (loss 3.2263): 97%|ββββββββββ| 608/625 [08:31<00:14, 1.17it/s]
Training 1/1 epoch (loss 3.2263): 97%|ββββββββββ| 609/625 [08:31<00:11, 1.35it/s]
Training 1/1 epoch (loss 3.1086): 97%|ββββββββββ| 609/625 [08:32<00:11, 1.35it/s]
Training 1/1 epoch (loss 3.1086): 98%|ββββββββββ| 610/625 [08:32<00:11, 1.34it/s]
Training 1/1 epoch (loss 3.1664): 98%|ββββββββββ| 610/625 [08:33<00:11, 1.34it/s]
Training 1/1 epoch (loss 3.1664): 98%|ββββββββββ| 611/625 [08:33<00:10, 1.28it/s]
Training 1/1 epoch (loss 3.3633): 98%|ββββββββββ| 611/625 [08:33<00:10, 1.28it/s]
Training 1/1 epoch (loss 3.3633): 98%|ββββββββββ| 612/625 [08:33<00:10, 1.30it/s]
Training 1/1 epoch (loss 3.0807): 98%|ββββββββββ| 612/625 [08:34<00:10, 1.30it/s]
Training 1/1 epoch (loss 3.0807): 98%|ββββββββββ| 613/625 [08:34<00:08, 1.37it/s]
Training 1/1 epoch (loss 3.3911): 98%|ββββββββββ| 613/625 [08:35<00:08, 1.37it/s]
Training 1/1 epoch (loss 3.3911): 98%|ββββββββββ| 614/625 [08:35<00:07, 1.38it/s]
Training 1/1 epoch (loss 3.0224): 98%|ββββββββββ| 614/625 [08:36<00:07, 1.38it/s]
Training 1/1 epoch (loss 3.0224): 98%|ββββββββββ| 615/625 [08:36<00:07, 1.26it/s]
Training 1/1 epoch (loss 3.0853): 98%|ββββββββββ| 615/625 [08:37<00:07, 1.26it/s]
Training 1/1 epoch (loss 3.0853): 99%|ββββββββββ| 616/625 [08:37<00:07, 1.22it/s]
Training 1/1 epoch (loss 3.3323): 99%|ββββββββββ| 616/625 [08:38<00:07, 1.22it/s]
Training 1/1 epoch (loss 3.3323): 99%|ββββββββββ| 617/625 [08:38<00:07, 1.11it/s]
Training 1/1 epoch (loss 3.2907): 99%|ββββββββββ| 617/625 [08:39<00:07, 1.11it/s]
Training 1/1 epoch (loss 3.2907): 99%|ββββββββββ| 618/625 [08:39<00:06, 1.11it/s]
Training 1/1 epoch (loss 3.0927): 99%|ββββββββββ| 618/625 [08:39<00:06, 1.11it/s]
Training 1/1 epoch (loss 3.0927): 99%|ββββββββββ| 619/625 [08:39<00:05, 1.19it/s]
Training 1/1 epoch (loss 3.0040): 99%|ββββββββββ| 619/625 [08:40<00:05, 1.19it/s]
Training 1/1 epoch (loss 3.0040): 99%|ββββββββββ| 620/625 [08:40<00:04, 1.21it/s]
Training 1/1 epoch (loss 3.3394): 99%|ββββββββββ| 620/625 [08:41<00:04, 1.21it/s]
Training 1/1 epoch (loss 3.3394): 99%|ββββββββββ| 621/625 [08:41<00:03, 1.12it/s]
Training 1/1 epoch (loss 3.1001): 99%|ββββββββββ| 621/625 [08:42<00:03, 1.12it/s]
Training 1/1 epoch (loss 3.1001): 100%|ββββββββββ| 622/625 [08:42<00:02, 1.19it/s]
Training 1/1 epoch (loss 3.3071): 100%|ββββββββββ| 622/625 [08:42<00:02, 1.19it/s]
Training 1/1 epoch (loss 3.3071): 100%|ββββββββββ| 623/625 [08:42<00:01, 1.34it/s]
Training 1/1 epoch (loss 3.1016): 100%|ββββββββββ| 623/625 [08:43<00:01, 1.34it/s]
Training 1/1 epoch (loss 3.1016): 100%|ββββββββββ| 624/625 [08:43<00:00, 1.26it/s]
Training 1/1 epoch (loss 3.1888): 100%|ββββββββββ| 624/625 [08:44<00:00, 1.26it/s]
Training 1/1 epoch (loss 3.1888): 100%|ββββββββββ| 625/625 [08:44<00:00, 1.10it/s]
Training 1/1 epoch (loss 3.1888): 100%|ββββββββββ| 625/625 [08:44<00:00, 1.19it/s] |