File size: 14,790 Bytes
7726997
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
 > Training Environment:
 | > Backend: Torch
 | > Mixed precision: False
 | > Precision: float32
 | > Current device: 0
 | > Num. of GPUs: 1
 | > Num. of CPUs: 64
 | > Num. of Torch Threads: 1
 | > Torch seed: 1
 | > Torch CUDNN: True
 | > Torch CUDNN deterministic: False
 | > Torch CUDNN benchmark: False
 | > Torch TF32 MatMul: False
 > Start Tensorboard: tensorboard --logdir=/workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000

 > Model has 518442047 parameters

 > EPOCH: 0/1000
 --> /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000

 > TRAINING (2024-04-23 12:18:40) 

   --> TIME: 2024-04-23 12:18:42 -- STEP: 0/1695 -- GLOBAL_STEP: 0
     | > loss_text_ce: 0.042592838406562805  (0.042592838406562805)
     | > loss_mel_ce: 3.744250535964966  (3.744250535964966)
     | > loss: 0.04508147016167641  (0.04508147016167641)
     | > current_lr: 4e-06 
     | > step_time: 0.3181  (0.3181343078613281)
     | > loader_time: 1.1535  (1.153491735458374)


   --> TIME: 2024-04-23 12:18:50 -- STEP: 50/1695 -- GLOBAL_STEP: 50
     | > loss_text_ce: 0.043245986104011536  (0.045777649357914924)
     | > loss_mel_ce: 4.0826735496521  (3.678379626274109)
     | > loss: 0.04911808669567108  (0.044335206523537646)
     | > current_lr: 4e-06 
     | > step_time: 0.1173  (0.10748531341552735)
     | > loader_time: 0.0038  (0.012436685562133789)


   --> TIME: 2024-04-23 12:19:00 -- STEP: 100/1695 -- GLOBAL_STEP: 100
     | > loss_text_ce: 0.04654935747385025  (0.04617325332015751)
     | > loss_mel_ce: 3.7310783863067627  (3.6352836871147156)
     | > loss: 0.044971760362386703  (0.04382686924189331)
     | > current_lr: 4e-06 
     | > step_time: 0.1229  (0.1165578818321228)
     | > loader_time: 0.0044  (0.010995228290557862)


   --> TIME: 2024-04-23 12:19:10 -- STEP: 150/1695 -- GLOBAL_STEP: 150
     | > loss_text_ce: 0.04864665865898132  (0.04633487790822981)
     | > loss_mel_ce: 3.695878267288208  (3.5984654172261554)
     | > loss: 0.04457768052816391  (0.04339048052827519)
     | > current_lr: 4e-06 
     | > step_time: 0.0968  (0.12075453917185465)
     | > loader_time: 0.0068  (0.009986537297566734)


   --> TIME: 2024-04-23 12:19:21 -- STEP: 200/1695 -- GLOBAL_STEP: 200
     | > loss_text_ce: 0.04507960379123688  (0.04615468136966227)
     | > loss_mel_ce: 3.4362077713012695  (3.5497735607624055)
     | > loss: 0.041443899273872375  (0.042808670215308674)
     | > current_lr: 4e-06 
     | > step_time: 0.1431  (0.12541004419326782)
     | > loader_time: 0.004  (0.009364948272705077)


   --> TIME: 2024-04-23 12:19:33 -- STEP: 250/1695 -- GLOBAL_STEP: 250
     | > loss_text_ce: 0.044978540390729904  (0.04600780452787875)
     | > loss_mel_ce: 3.3835601806640625  (3.5098479528427124)
     | > loss: 0.040815938264131546  (0.04233161683380605)
     | > current_lr: 4e-06 
     | > step_time: 0.1506  (0.12965419387817378)
     | > loader_time: 0.0043  (0.008812045097351074)


   --> TIME: 2024-04-23 12:19:45 -- STEP: 300/1695 -- GLOBAL_STEP: 300
     | > loss_text_ce: 0.04761254042387009  (0.046099709086120134)
     | > loss_mel_ce: 3.859790325164795  (3.4856272101402284)
     | > loss: 0.04651670157909393  (0.04204436879605055)
     | > current_lr: 4e-06 
     | > step_time: 0.107  (0.13299476464589427)
     | > loader_time: 0.0045  (0.008340648015340164)


   --> TIME: 2024-04-23 12:19:57 -- STEP: 350/1695 -- GLOBAL_STEP: 350
     | > loss_text_ce: 0.041058849543333054  (0.04608927173273904)
     | > loss_mel_ce: 3.2493679523468018  (3.4584123958860125)
     | > loss: 0.039171747863292694  (0.04172025864677771)
     | > current_lr: 4e-06 
     | > step_time: 0.1586  (0.1357990046909877)
     | > loader_time: 0.0092  (0.007999198096139085)


   --> TIME: 2024-04-23 12:20:09 -- STEP: 400/1695 -- GLOBAL_STEP: 400
     | > loss_text_ce: 0.0439525842666626  (0.04606584513559937)
     | > loss_mel_ce: 3.5535271167755127  (3.4283770048618316)
     | > loss: 0.04282714053988457  (0.04136241558939219)
     | > current_lr: 4e-06 
     | > step_time: 0.1314  (0.13862687826156628)
     | > loader_time: 0.0039  (0.007809545397758481)


 > CHECKPOINT : /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000/checkpoint_400.pth

   --> TIME: 2024-04-23 12:20:24 -- STEP: 450/1695 -- GLOBAL_STEP: 450
     | > loss_text_ce: 0.05098263919353485  (0.04611284121870995)
     | > loss_mel_ce: 2.9446003437042236  (3.401099606090122)
     | > loss: 0.03566170483827591  (0.04103824413898918)
     | > current_lr: 4e-06 
     | > step_time: 0.1581  (0.1393417421976725)
     | > loader_time: 0.0041  (0.0075671084721883105)


   --> TIME: 2024-04-23 12:20:36 -- STEP: 500/1695 -- GLOBAL_STEP: 500
     | > loss_text_ce: 0.03936528041958809  (0.04605886636674404)
     | > loss_mel_ce: 3.534381628036499  (3.3785955691337586)
     | > loss: 0.04254460707306862  (0.04076969639584422)
     | > current_lr: 4e-06 
     | > step_time: 0.1362  (0.1412919845581054)
     | > loader_time: 0.0044  (0.007305326461791989)


   --> TIME: 2024-04-23 12:20:49 -- STEP: 550/1695 -- GLOBAL_STEP: 550
     | > loss_text_ce: 0.043622393161058426  (0.04607608911666003)
     | > loss_mel_ce: 3.36867618560791  (3.351200197826734)
     | > loss: 0.04062260314822197  (0.04044376604597676)
     | > current_lr: 4e-06 
     | > step_time: 0.1491  (0.1432263898849487)
     | > loader_time: 0.0044  (0.007147459983825681)


   --> TIME: 2024-04-23 12:21:01 -- STEP: 600/1695 -- GLOBAL_STEP: 600
     | > loss_text_ce: 0.04180557280778885  (0.04603437863911191)
     | > loss_mel_ce: 3.1069161891937256  (3.328243460655214)
     | > loss: 0.0374847836792469  (0.04016997500322759)
     | > current_lr: 4e-06 
     | > step_time: 0.1583  (0.14447109142939243)
     | > loader_time: 0.0047  (0.006965583960215248)


   --> TIME: 2024-04-23 12:21:14 -- STEP: 650/1695 -- GLOBAL_STEP: 650
     | > loss_text_ce: 0.04896671324968338  (0.04602042846381666)
     | > loss_mel_ce: 3.0476784706115723  (3.3038424359835123)
     | > loss: 0.03686482459306717  (0.03987932055042337)
     | > current_lr: 4e-06 
     | > step_time: 0.1219  (0.14626641933734613)
     | > loader_time: 0.0047  (0.006803958232586197)


   --> TIME: 2024-04-23 12:21:27 -- STEP: 700/1695 -- GLOBAL_STEP: 700
     | > loss_text_ce: 0.04512707144021988  (0.046030817106366195)
     | > loss_mel_ce: 3.066598892211914  (3.2816116438593195)
     | > loss: 0.037044357508420944  (0.03961479195526668)
     | > current_lr: 4e-06 
     | > step_time: 0.1502  (0.14775025640215206)
     | > loader_time: 0.0044  (0.006717268739427837)


   --> TIME: 2024-04-23 12:21:40 -- STEP: 750/1695 -- GLOBAL_STEP: 750
     | > loss_text_ce: 0.04244884476065636  (0.04599520656466488)
     | > loss_mel_ce: 2.8379921913146973  (3.264411670366924)
     | > loss: 0.034290965646505356  (0.03940960643688838)
     | > current_lr: 4e-06 
     | > step_time: 0.218  (0.14881795597076428)
     | > loader_time: 0.0049  (0.006605740865071612)


   --> TIME: 2024-04-23 12:21:53 -- STEP: 800/1695 -- GLOBAL_STEP: 800
     | > loss_text_ce: 0.04257930815219879  (0.04597263523377482)
     | > loss_mel_ce: 2.8074073791503906  (3.2470336309075365)
     | > loss: 0.033928416669368744  (0.03920245631132278)
     | > current_lr: 4e-06 
     | > step_time: 0.151  (0.14975822657346743)
     | > loader_time: 0.0045  (0.006505406498908994)


 > CHECKPOINT : /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000/checkpoint_800.pth

   --> TIME: 2024-04-23 12:22:08 -- STEP: 850/1695 -- GLOBAL_STEP: 850
     | > loss_text_ce: 0.046279508620500565  (0.04595626743400801)
     | > loss_mel_ce: 2.9114205837249756  (3.232562539998224)
     | > loss: 0.03521071374416351  (0.0390299865691101)
     | > current_lr: 4e-06 
     | > step_time: 0.1686  (0.14989260813769187)
     | > loader_time: 0.0041  (0.006382525107439824)


   --> TIME: 2024-04-23 12:22:21 -- STEP: 900/1695 -- GLOBAL_STEP: 900
     | > loss_text_ce: 0.04815426096320152  (0.045925861448049575)
     | > loss_mel_ce: 2.881121873855591  (3.21540697336197)
     | > loss: 0.03487233817577362  (0.03882539166758459)
     | > current_lr: 4e-06 
     | > step_time: 0.1067  (0.15040262672636256)
     | > loader_time: 0.0163  (0.006299734380510116)


   --> TIME: 2024-04-23 12:22:33 -- STEP: 950/1695 -- GLOBAL_STEP: 950
     | > loss_text_ce: 0.046194590628147125  (0.045895876723684795)
     | > loss_mel_ce: 2.452665328979492  (3.2002050801327364)
     | > loss: 0.029748331755399704  (0.03864405976706431)
     | > current_lr: 4e-06 
     | > step_time: 0.1483  (0.1510076773794075)
     | > loader_time: 0.0041  (0.006192966511375024)


   --> TIME: 2024-04-23 12:22:46 -- STEP: 1000/1695 -- GLOBAL_STEP: 1000
     | > loss_text_ce: 0.04607674479484558  (0.04585176565870645)
     | > loss_mel_ce: 2.9387059211730957  (3.187430265903474)
     | > loss: 0.035533126443624496  (0.03849145351536573)
     | > current_lr: 4e-06 
     | > step_time: 0.1648  (0.15175995016098034)
     | > loader_time: 0.0044  (0.006122385978698729)


   --> TIME: 2024-04-23 12:22:59 -- STEP: 1050/1695 -- GLOBAL_STEP: 1050
     | > loss_text_ce: 0.0466134138405323  (0.045852795899623947)
     | > loss_mel_ce: 2.9738194942474365  (3.172598667598907)
     | > loss: 0.03595753759145737  (0.03831489915826492)
     | > current_lr: 4e-06 
     | > step_time: 0.1396  (0.15251214708600735)
     | > loader_time: 0.0049  (0.00607424667903355)


   --> TIME: 2024-04-23 12:23:12 -- STEP: 1100/1695 -- GLOBAL_STEP: 1100
     | > loss_text_ce: 0.04659873992204666  (0.04585135899822824)
     | > loss_mel_ce: 2.4221293926239014  (3.1576039728251373)
     | > loss: 0.029389619827270508  (0.038136373775249206)
     | > current_lr: 4e-06 
     | > step_time: 0.186  (0.15299982179294946)
     | > loader_time: 0.0048  (0.006017334894700482)


   --> TIME: 2024-04-23 12:23:25 -- STEP: 1150/1695 -- GLOBAL_STEP: 1150
     | > loss_text_ce: 0.043769825249910355  (0.045824583464342636)
     | > loss_mel_ce: 2.859921455383301  (3.1463320172351352)
     | > loss: 0.034567754715681076  (0.03800186507079915)
     | > current_lr: 4e-06 
     | > step_time: 0.1919  (0.15354946157206664)
     | > loader_time: 0.0045  (0.005964664376300311)


   --> TIME: 2024-04-23 12:23:38 -- STEP: 1200/1695 -- GLOBAL_STEP: 1200
     | > loss_text_ce: 0.04848972707986832  (0.0457837945688516)
     | > loss_mel_ce: 2.9194998741149902  (3.13719070851803)
     | > loss: 0.035333212465047836  (0.037892554394590376)
     | > current_lr: 4e-06 
     | > step_time: 0.2642  (0.15420349061489103)
     | > loader_time: 0.0046  (0.005914180874824522)


 > CHECKPOINT : /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000/checkpoint_1200.pth

   --> TIME: 2024-04-23 12:23:54 -- STEP: 1250/1695 -- GLOBAL_STEP: 1250
     | > loss_text_ce: 0.044037092477083206  (0.04573154278397562)
     | > loss_mel_ce: 2.6508209705352783  (3.125970713424685)
     | > loss: 0.03208164498209953  (0.03775836098492144)
     | > current_lr: 4e-06 
     | > step_time: 0.1472  (0.15428158359527583)
     | > loader_time: 0.0044  (0.0058847253799438485)


   --> TIME: 2024-04-23 12:24:07 -- STEP: 1300/1695 -- GLOBAL_STEP: 1300
     | > loss_text_ce: 0.04510482773184776  (0.04571826306959757)
     | > loss_mel_ce: 3.4077906608581543  (3.1158635647480315)
     | > loss: 0.04110589995980263  (0.03763787967797653)
     | > current_lr: 4e-06 
     | > step_time: 0.1634  (0.1545918438984798)
     | > loader_time: 0.0047  (0.00584103455910316)


   --> TIME: 2024-04-23 12:24:19 -- STEP: 1350/1695 -- GLOBAL_STEP: 1350
     | > loss_text_ce: 0.0476665161550045  (0.0456638012136574)
     | > loss_mel_ce: 2.8584489822387695  (3.1040570794211506)
     | > loss: 0.03459661453962326  (0.03749667791994631)
     | > current_lr: 4e-06 
     | > step_time: 0.1331  (0.15490423820636887)
     | > loader_time: 0.0044  (0.0058183479309082044)


   --> TIME: 2024-04-23 12:24:32 -- STEP: 1400/1695 -- GLOBAL_STEP: 1400
     | > loss_text_ce: 0.04452496021986008  (0.04561551003051656)
     | > loss_mel_ce: 3.234622001647949  (3.0916232017108385)
     | > loss: 0.03903746232390404  (0.03734808066327656)
     | > current_lr: 4e-06 
     | > step_time: 0.1215  (0.15542454413005272)
     | > loader_time: 0.0045  (0.005781678301947453)


   --> TIME: 2024-04-23 12:24:45 -- STEP: 1450/1695 -- GLOBAL_STEP: 1450
     | > loss_text_ce: 0.042180027812719345  (0.04556434659608479)
     | > loss_mel_ce: 2.699432134628296  (3.080473143150068)
     | > loss: 0.03263824060559273  (0.037214732784135576)
     | > current_lr: 4e-06 
     | > step_time: 0.167  (0.15565427286871544)
     | > loader_time: 0.0044  (0.005737697502662392)


   --> TIME: 2024-04-23 12:24:57 -- STEP: 1500/1695 -- GLOBAL_STEP: 1500
     | > loss_text_ce: 0.04820888489484787  (0.04552951066195965)
     | > loss_mel_ce: 2.6011390686035156  (3.0704109377861033)
     | > loss: 0.031539857387542725  (0.037094529901941585)
     | > current_lr: 4e-06 
     | > step_time: 0.1594  (0.15576313861211125)
     | > loader_time: 0.0044  (0.005703491051991777)


   --> TIME: 2024-04-23 12:25:11 -- STEP: 1550/1695 -- GLOBAL_STEP: 1550
     | > loss_text_ce: 0.045843496918678284  (0.04549794487655163)
     | > loss_mel_ce: 2.6503143310546875  (3.059678205059422)
     | > loss: 0.032097119837999344  (0.036966383488428164)
     | > current_lr: 4e-06 
     | > step_time: 0.1592  (0.15621070800289008)
     | > loader_time: 0.0046  (0.005665828643306605)


   --> TIME: 2024-04-23 12:25:24 -- STEP: 1600/1695 -- GLOBAL_STEP: 1600
     | > loss_text_ce: 0.04320811480283737  (0.045465721643995496)
     | > loss_mel_ce: 2.5281929969787598  (3.049819415509702)
     | > loss: 0.0306119192391634  (0.03684863331844095)
     | > current_lr: 4e-06 
     | > step_time: 0.1849  (0.15655839025974236)
     | > loader_time: 0.0047  (0.005632958114147183)


 > CHECKPOINT : /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000/checkpoint_1600.pth

   --> TIME: 2024-04-23 12:25:39 -- STEP: 1650/1695 -- GLOBAL_STEP: 1650
     | > loss_text_ce: 0.04909869655966759  (0.04544659794957349)
     | > loss_mel_ce: 2.6178195476531982  (3.039313706195717)
     | > loss: 0.03174902871251106  (0.036723337676940526)
     | > current_lr: 4e-06 
     | > step_time: 0.1669  (0.1564438345938015)
     | > loader_time: 0.0042  (0.005605537674643773)

 ! Run is kept in /workspace/run/training/GPT_XTTS_v2.0_LJSpeech_FT-April-23-2024_12+18PM-0000000