zemaia commited on
Commit
29a034d
·
verified ·
1 Parent(s): a6bce45

End of training

Browse files
Files changed (2) hide show
  1. README.md +247 -260
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [adalbertojunior/distilbert-portuguese-cased](https://huggingface.co/adalbertojunior/distilbert-portuguese-cased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6466
19
 
20
  ## Model description
21
 
@@ -47,265 +47,252 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:--------:|:-----:|:---------------:|
50
- | 6.8123 | 1.3889 | 100 | 5.5177 |
51
- | 5.1647 | 2.7778 | 200 | 4.6195 |
52
- | 4.4717 | 4.1667 | 300 | 4.0395 |
53
- | 4.0232 | 5.5556 | 400 | 3.6607 |
54
- | 3.6917 | 6.9444 | 500 | 3.3826 |
55
- | 3.4525 | 8.3333 | 600 | 3.1628 |
56
- | 3.2549 | 9.7222 | 700 | 3.0003 |
57
- | 3.0811 | 11.1111 | 800 | 2.8633 |
58
- | 2.959 | 12.5 | 900 | 2.7506 |
59
- | 2.8471 | 13.8889 | 1000 | 2.6297 |
60
- | 2.7321 | 15.2778 | 1100 | 2.5441 |
61
- | 2.6444 | 16.6667 | 1200 | 2.4690 |
62
- | 2.5641 | 18.0556 | 1300 | 2.3772 |
63
- | 2.4889 | 19.4444 | 1400 | 2.3022 |
64
- | 2.4214 | 20.8333 | 1500 | 2.2521 |
65
- | 2.3677 | 22.2222 | 1600 | 2.2045 |
66
- | 2.3108 | 23.6111 | 1700 | 2.1531 |
67
- | 2.2519 | 25.0 | 1800 | 2.1167 |
68
- | 2.2159 | 26.3889 | 1900 | 2.0711 |
69
- | 2.1751 | 27.7778 | 2000 | 2.0200 |
70
- | 2.1338 | 29.1667 | 2100 | 1.9792 |
71
- | 2.092 | 30.5556 | 2200 | 1.9560 |
72
- | 2.0469 | 31.9444 | 2300 | 1.9302 |
73
- | 2.0119 | 33.3333 | 2400 | 1.8737 |
74
- | 1.9751 | 34.7222 | 2500 | 1.8639 |
75
- | 1.9557 | 36.1111 | 2600 | 1.8357 |
76
- | 1.9265 | 37.5 | 2700 | 1.8006 |
77
- | 1.8883 | 38.8889 | 2800 | 1.7937 |
78
- | 1.862 | 40.2778 | 2900 | 1.7344 |
79
- | 1.8457 | 41.6667 | 3000 | 1.7238 |
80
- | 1.811 | 43.0556 | 3100 | 1.7025 |
81
- | 1.7889 | 44.4444 | 3200 | 1.6837 |
82
- | 1.7656 | 45.8333 | 3300 | 1.6712 |
83
- | 1.7372 | 47.2222 | 3400 | 1.6261 |
84
- | 1.7189 | 48.6111 | 3500 | 1.6136 |
85
- | 1.6957 | 50.0 | 3600 | 1.6015 |
86
- | 1.6774 | 51.3889 | 3700 | 1.5803 |
87
- | 1.6551 | 52.7778 | 3800 | 1.5728 |
88
- | 1.638 | 54.1667 | 3900 | 1.5398 |
89
- | 1.6161 | 55.5556 | 4000 | 1.5423 |
90
- | 1.5986 | 56.9444 | 4100 | 1.5037 |
91
- | 1.5852 | 58.3333 | 4200 | 1.4801 |
92
- | 1.5718 | 59.7222 | 4300 | 1.4826 |
93
- | 1.5483 | 61.1111 | 4400 | 1.4776 |
94
- | 1.5326 | 62.5 | 4500 | 1.4548 |
95
- | 1.5228 | 63.8889 | 4600 | 1.4442 |
96
- | 1.4965 | 65.2778 | 4700 | 1.4031 |
97
- | 1.4702 | 66.6667 | 4800 | 1.3834 |
98
- | 1.4603 | 68.0556 | 4900 | 1.3778 |
99
- | 1.441 | 69.4444 | 5000 | 1.3707 |
100
- | 1.4263 | 70.8333 | 5100 | 1.3522 |
101
- | 1.4136 | 72.2222 | 5200 | 1.3273 |
102
- | 1.399 | 73.6111 | 5300 | 1.3429 |
103
- | 1.3844 | 75.0 | 5400 | 1.3061 |
104
- | 1.3724 | 76.3889 | 5500 | 1.3003 |
105
- | 1.3596 | 77.7778 | 5600 | 1.2754 |
106
- | 1.3488 | 79.1667 | 5700 | 1.2679 |
107
- | 1.3414 | 80.5556 | 5800 | 1.2614 |
108
- | 1.3335 | 81.9444 | 5900 | 1.2568 |
109
- | 1.3165 | 83.3333 | 6000 | 1.2440 |
110
- | 1.3078 | 84.7222 | 6100 | 1.2387 |
111
- | 1.2914 | 86.1111 | 6200 | 1.2341 |
112
- | 1.2867 | 87.5 | 6300 | 1.2264 |
113
- | 1.2758 | 88.8889 | 6400 | 1.2150 |
114
- | 1.2709 | 90.2778 | 6500 | 1.2056 |
115
- | 1.257 | 91.6667 | 6600 | 1.2121 |
116
- | 1.2455 | 93.0556 | 6700 | 1.1860 |
117
- | 1.2354 | 94.4444 | 6800 | 1.1787 |
118
- | 1.2298 | 95.8333 | 6900 | 1.1604 |
119
- | 1.2202 | 97.2222 | 7000 | 1.1632 |
120
- | 1.2045 | 98.6111 | 7100 | 1.1477 |
121
- | 1.2062 | 100.0 | 7200 | 1.1484 |
122
- | 1.2039 | 101.3889 | 7300 | 1.1493 |
123
- | 1.1851 | 102.7778 | 7400 | 1.1298 |
124
- | 1.1806 | 104.1667 | 7500 | 1.1277 |
125
- | 1.1616 | 105.5556 | 7600 | 1.1080 |
126
- | 1.1614 | 106.9444 | 7700 | 1.1081 |
127
- | 1.1504 | 108.3333 | 7800 | 1.1334 |
128
- | 1.1407 | 109.7222 | 7900 | 1.1024 |
129
- | 1.1318 | 111.1111 | 8000 | 1.0949 |
130
- | 1.1258 | 112.5 | 8100 | 1.0917 |
131
- | 1.1212 | 113.8889 | 8200 | 1.0718 |
132
- | 1.119 | 115.2778 | 8300 | 1.0893 |
133
- | 1.102 | 116.6667 | 8400 | 1.0606 |
134
- | 1.091 | 118.0556 | 8500 | 1.0709 |
135
- | 1.0834 | 119.4444 | 8600 | 1.0493 |
136
- | 1.0964 | 120.8333 | 8700 | 1.0448 |
137
- | 1.0775 | 122.2222 | 8800 | 1.0432 |
138
- | 1.076 | 123.6111 | 8900 | 1.0309 |
139
- | 1.0602 | 125.0 | 9000 | 1.0191 |
140
- | 1.0583 | 126.3889 | 9100 | 1.0346 |
141
- | 1.052 | 127.7778 | 9200 | 1.0326 |
142
- | 1.0416 | 129.1667 | 9300 | 1.0146 |
143
- | 1.0404 | 130.5556 | 9400 | 1.0035 |
144
- | 1.0254 | 131.9444 | 9500 | 1.0022 |
145
- | 1.0302 | 133.3333 | 9600 | 1.0067 |
146
- | 1.0219 | 134.7222 | 9700 | 1.0029 |
147
- | 1.0171 | 136.1111 | 9800 | 0.9713 |
148
- | 1.0043 | 137.5 | 9900 | 0.9969 |
149
- | 1.0014 | 138.8889 | 10000 | 0.9847 |
150
- | 0.9972 | 140.2778 | 10100 | 0.9827 |
151
- | 0.9969 | 141.6667 | 10200 | 0.9771 |
152
- | 0.9848 | 143.0556 | 10300 | 0.9696 |
153
- | 0.9851 | 144.4444 | 10400 | 0.9619 |
154
- | 0.9735 | 145.8333 | 10500 | 0.9598 |
155
- | 0.9652 | 147.2222 | 10600 | 0.9435 |
156
- | 0.9669 | 148.6111 | 10700 | 0.9475 |
157
- | 0.9594 | 150.0 | 10800 | 0.9416 |
158
- | 0.9584 | 151.3889 | 10900 | 0.9433 |
159
- | 0.9486 | 152.7778 | 11000 | 0.9389 |
160
- | 0.9456 | 154.1667 | 11100 | 0.9329 |
161
- | 0.9399 | 155.5556 | 11200 | 0.9354 |
162
- | 0.9265 | 156.9444 | 11300 | 0.9146 |
163
- | 0.9269 | 158.3333 | 11400 | 0.9213 |
164
- | 0.9333 | 159.7222 | 11500 | 0.9171 |
165
- | 0.9222 | 161.1111 | 11600 | 0.9276 |
166
- | 0.9171 | 162.5 | 11700 | 0.9104 |
167
- | 0.9153 | 163.8889 | 11800 | 0.9081 |
168
- | 0.9018 | 165.2778 | 11900 | 0.9064 |
169
- | 0.9097 | 166.6667 | 12000 | 0.8837 |
170
- | 0.8998 | 168.0556 | 12100 | 0.8802 |
171
- | 0.8904 | 169.4444 | 12200 | 0.8866 |
172
- | 0.8876 | 170.8333 | 12300 | 0.8672 |
173
- | 0.8893 | 172.2222 | 12400 | 0.8894 |
174
- | 0.8816 | 173.6111 | 12500 | 0.8660 |
175
- | 0.88 | 175.0 | 12600 | 0.8911 |
176
- | 0.8767 | 176.3889 | 12700 | 0.8532 |
177
- | 0.8651 | 177.7778 | 12800 | 0.8675 |
178
- | 0.8625 | 179.1667 | 12900 | 0.8567 |
179
- | 0.8574 | 180.5556 | 13000 | 0.8608 |
180
- | 0.8591 | 181.9444 | 13100 | 0.8706 |
181
- | 0.8526 | 183.3333 | 13200 | 0.8568 |
182
- | 0.8492 | 184.7222 | 13300 | 0.8423 |
183
- | 0.8481 | 186.1111 | 13400 | 0.8570 |
184
- | 0.8452 | 187.5 | 13500 | 0.8302 |
185
- | 0.841 | 188.8889 | 13600 | 0.8306 |
186
- | 0.8429 | 190.2778 | 13700 | 0.8372 |
187
- | 0.83 | 191.6667 | 13800 | 0.8337 |
188
- | 0.8356 | 193.0556 | 13900 | 0.8261 |
189
- | 0.8318 | 194.4444 | 14000 | 0.8363 |
190
- | 0.8218 | 195.8333 | 14100 | 0.8136 |
191
- | 0.82 | 197.2222 | 14200 | 0.8140 |
192
- | 0.8111 | 198.6111 | 14300 | 0.8330 |
193
- | 0.8128 | 200.0 | 14400 | 0.8203 |
194
- | 0.8082 | 201.3889 | 14500 | 0.8001 |
195
- | 0.8071 | 202.7778 | 14600 | 0.8090 |
196
- | 0.8033 | 204.1667 | 14700 | 0.8148 |
197
- | 0.7964 | 205.5556 | 14800 | 0.7944 |
198
- | 0.7965 | 206.9444 | 14900 | 0.8101 |
199
- | 0.7936 | 208.3333 | 15000 | 0.7992 |
200
- | 0.7838 | 209.7222 | 15100 | 0.8061 |
201
- | 0.7834 | 211.1111 | 15200 | 0.7989 |
202
- | 0.7829 | 212.5 | 15300 | 0.7893 |
203
- | 0.7779 | 213.8889 | 15400 | 0.8032 |
204
- | 0.7761 | 215.2778 | 15500 | 0.7841 |
205
- | 0.7776 | 216.6667 | 15600 | 0.7834 |
206
- | 0.7743 | 218.0556 | 15700 | 0.7865 |
207
- | 0.7696 | 219.4444 | 15800 | 0.7808 |
208
- | 0.7702 | 220.8333 | 15900 | 0.7761 |
209
- | 0.7608 | 222.2222 | 16000 | 0.7916 |
210
- | 0.7571 | 223.6111 | 16100 | 0.7580 |
211
- | 0.7569 | 225.0 | 16200 | 0.7800 |
212
- | 0.7495 | 226.3889 | 16300 | 0.7717 |
213
- | 0.7554 | 227.7778 | 16400 | 0.7718 |
214
- | 0.7455 | 229.1667 | 16500 | 0.7549 |
215
- | 0.7476 | 230.5556 | 16600 | 0.7609 |
216
- | 0.7477 | 231.9444 | 16700 | 0.7813 |
217
- | 0.7495 | 233.3333 | 16800 | 0.7411 |
218
- | 0.7328 | 234.7222 | 16900 | 0.7550 |
219
- | 0.7363 | 236.1111 | 17000 | 0.7476 |
220
- | 0.732 | 237.5 | 17100 | 0.7501 |
221
- | 0.7353 | 238.8889 | 17200 | 0.7566 |
222
- | 0.7294 | 240.2778 | 17300 | 0.7464 |
223
- | 0.7231 | 241.6667 | 17400 | 0.7455 |
224
- | 0.7227 | 243.0556 | 17500 | 0.7385 |
225
- | 0.7225 | 244.4444 | 17600 | 0.7269 |
226
- | 0.7166 | 245.8333 | 17700 | 0.7340 |
227
- | 0.7147 | 247.2222 | 17800 | 0.7361 |
228
- | 0.7158 | 248.6111 | 17900 | 0.7351 |
229
- | 0.7163 | 250.0 | 18000 | 0.7336 |
230
- | 0.7112 | 251.3889 | 18100 | 0.7418 |
231
- | 0.7073 | 252.7778 | 18200 | 0.7328 |
232
- | 0.7067 | 254.1667 | 18300 | 0.7345 |
233
- | 0.7094 | 255.5556 | 18400 | 0.7278 |
234
- | 0.7047 | 256.9444 | 18500 | 0.7147 |
235
- | 0.7006 | 258.3333 | 18600 | 0.7229 |
236
- | 0.6921 | 259.7222 | 18700 | 0.7239 |
237
- | 0.6998 | 261.1111 | 18800 | 0.7226 |
238
- | 0.6939 | 262.5 | 18900 | 0.7211 |
239
- | 0.6934 | 263.8889 | 19000 | 0.7052 |
240
- | 0.6868 | 265.2778 | 19100 | 0.7150 |
241
- | 0.6799 | 266.6667 | 19200 | 0.7285 |
242
- | 0.6835 | 268.0556 | 19300 | 0.7128 |
243
- | 0.6865 | 269.4444 | 19400 | 0.7006 |
244
- | 0.688 | 270.8333 | 19500 | 0.7135 |
245
- | 0.6798 | 272.2222 | 19600 | 0.6953 |
246
- | 0.6746 | 273.6111 | 19700 | 0.7109 |
247
- | 0.6783 | 275.0 | 19800 | 0.7154 |
248
- | 0.6732 | 276.3889 | 19900 | 0.7115 |
249
- | 0.6715 | 277.7778 | 20000 | 0.6976 |
250
- | 0.6702 | 279.1667 | 20100 | 0.6889 |
251
- | 0.6699 | 280.5556 | 20200 | 0.6835 |
252
- | 0.6663 | 281.9444 | 20300 | 0.6947 |
253
- | 0.6622 | 283.3333 | 20400 | 0.6844 |
254
- | 0.6618 | 284.7222 | 20500 | 0.6868 |
255
- | 0.6674 | 286.1111 | 20600 | 0.6933 |
256
- | 0.6567 | 287.5 | 20700 | 0.6893 |
257
- | 0.6593 | 288.8889 | 20800 | 0.6868 |
258
- | 0.6613 | 290.2778 | 20900 | 0.6828 |
259
- | 0.6635 | 291.6667 | 21000 | 0.6707 |
260
- | 0.6523 | 293.0556 | 21100 | 0.6829 |
261
- | 0.6566 | 294.4444 | 21200 | 0.6748 |
262
- | 0.6513 | 295.8333 | 21300 | 0.6787 |
263
- | 0.6539 | 297.2222 | 21400 | 0.6762 |
264
- | 0.6436 | 298.6111 | 21500 | 0.6711 |
265
- | 0.6433 | 300.0 | 21600 | 0.6742 |
266
- | 0.6443 | 301.3889 | 21700 | 0.6656 |
267
- | 0.6354 | 302.7778 | 21800 | 0.6677 |
268
- | 0.6465 | 304.1667 | 21900 | 0.6740 |
269
- | 0.6373 | 305.5556 | 22000 | 0.6732 |
270
- | 0.6363 | 306.9444 | 22100 | 0.6639 |
271
- | 0.6313 | 308.3333 | 22200 | 0.6699 |
272
- | 0.6318 | 309.7222 | 22300 | 0.6569 |
273
- | 0.6372 | 311.1111 | 22400 | 0.6557 |
274
- | 0.6333 | 312.5 | 22500 | 0.6539 |
275
- | 0.6307 | 313.8889 | 22600 | 0.6626 |
276
- | 0.6259 | 315.2778 | 22700 | 0.6710 |
277
- | 0.6288 | 316.6667 | 22800 | 0.6698 |
278
- | 0.6218 | 318.0556 | 22900 | 0.6599 |
279
- | 0.6305 | 319.4444 | 23000 | 0.6728 |
280
- | 0.6225 | 320.8333 | 23100 | 0.6600 |
281
- | 0.6227 | 322.2222 | 23200 | 0.6512 |
282
- | 0.624 | 323.6111 | 23300 | 0.6611 |
283
- | 0.6198 | 325.0 | 23400 | 0.6473 |
284
- | 0.622 | 326.3889 | 23500 | 0.6617 |
285
- | 0.6106 | 327.7778 | 23600 | 0.6658 |
286
- | 0.6183 | 329.1667 | 23700 | 0.6477 |
287
- | 0.6169 | 330.5556 | 23800 | 0.6394 |
288
- | 0.6157 | 331.9444 | 23900 | 0.6352 |
289
- | 0.614 | 333.3333 | 24000 | 0.6488 |
290
- | 0.6165 | 334.7222 | 24100 | 0.6331 |
291
- | 0.6111 | 336.1111 | 24200 | 0.6334 |
292
- | 0.6117 | 337.5 | 24300 | 0.6381 |
293
- | 0.6126 | 338.8889 | 24400 | 0.6349 |
294
- | 0.6026 | 340.2778 | 24500 | 0.6435 |
295
- | 0.6045 | 341.6667 | 24600 | 0.6470 |
296
- | 0.6021 | 343.0556 | 24700 | 0.6447 |
297
- | 0.6005 | 344.4444 | 24800 | 0.6343 |
298
- | 0.6012 | 345.8333 | 24900 | 0.6233 |
299
- | 0.5969 | 347.2222 | 25000 | 0.6348 |
300
- | 0.6008 | 348.6111 | 25100 | 0.6423 |
301
- | 0.5962 | 350.0 | 25200 | 0.6342 |
302
- | 0.5981 | 351.3889 | 25300 | 0.6258 |
303
- | 0.6001 | 352.7778 | 25400 | 0.6345 |
304
- | 0.6012 | 354.1667 | 25500 | 0.6331 |
305
- | 0.5912 | 355.5556 | 25600 | 0.6420 |
306
- | 0.585 | 356.9444 | 25700 | 0.6298 |
307
- | 0.5924 | 358.3333 | 25800 | 0.6444 |
308
- | 0.5875 | 359.7222 | 25900 | 0.6256 |
309
 
310
 
311
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [adalbertojunior/distilbert-portuguese-cased](https://huggingface.co/adalbertojunior/distilbert-portuguese-cased) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6264
19
 
20
  ## Model description
21
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:--------:|:-----:|:---------------:|
50
+ | 6.8891 | 1.5385 | 100 | 5.5076 |
51
+ | 5.1289 | 3.0769 | 200 | 4.5650 |
52
+ | 4.444 | 4.6154 | 300 | 3.9873 |
53
+ | 3.9906 | 6.1538 | 400 | 3.6108 |
54
+ | 3.6562 | 7.6923 | 500 | 3.3357 |
55
+ | 3.406 | 9.2308 | 600 | 3.1277 |
56
+ | 3.2193 | 10.7692 | 700 | 2.9534 |
57
+ | 3.0559 | 12.3077 | 800 | 2.8168 |
58
+ | 2.9276 | 13.8462 | 900 | 2.6756 |
59
+ | 2.82 | 15.3846 | 1000 | 2.5928 |
60
+ | 2.7174 | 16.9231 | 1100 | 2.5070 |
61
+ | 2.6316 | 18.4615 | 1200 | 2.4184 |
62
+ | 2.5452 | 20.0 | 1300 | 2.3554 |
63
+ | 2.478 | 21.5385 | 1400 | 2.2848 |
64
+ | 2.4092 | 23.0769 | 1500 | 2.2292 |
65
+ | 2.3571 | 24.6154 | 1600 | 2.1836 |
66
+ | 2.287 | 26.1538 | 1700 | 2.1197 |
67
+ | 2.2508 | 27.6923 | 1800 | 2.0870 |
68
+ | 2.1999 | 29.2308 | 1900 | 2.0416 |
69
+ | 2.1476 | 30.7692 | 2000 | 2.0292 |
70
+ | 2.108 | 32.3077 | 2100 | 1.9542 |
71
+ | 2.0812 | 33.8462 | 2200 | 1.9063 |
72
+ | 2.0348 | 35.3846 | 2300 | 1.8793 |
73
+ | 1.9974 | 36.9231 | 2400 | 1.8498 |
74
+ | 1.9685 | 38.4615 | 2500 | 1.8201 |
75
+ | 1.9393 | 40.0 | 2600 | 1.7741 |
76
+ | 1.9009 | 41.5385 | 2700 | 1.7620 |
77
+ | 1.8734 | 43.0769 | 2800 | 1.7417 |
78
+ | 1.8492 | 44.6154 | 2900 | 1.7261 |
79
+ | 1.823 | 46.1538 | 3000 | 1.7029 |
80
+ | 1.7955 | 47.6923 | 3100 | 1.6882 |
81
+ | 1.7686 | 49.2308 | 3200 | 1.6587 |
82
+ | 1.7536 | 50.7692 | 3300 | 1.6312 |
83
+ | 1.7244 | 52.3077 | 3400 | 1.6180 |
84
+ | 1.7024 | 53.8462 | 3500 | 1.5936 |
85
+ | 1.687 | 55.3846 | 3600 | 1.5634 |
86
+ | 1.6653 | 56.9231 | 3700 | 1.5554 |
87
+ | 1.6368 | 58.4615 | 3800 | 1.5247 |
88
+ | 1.6047 | 60.0 | 3900 | 1.4862 |
89
+ | 1.5864 | 61.5385 | 4000 | 1.4758 |
90
+ | 1.5592 | 63.0769 | 4100 | 1.4692 |
91
+ | 1.5481 | 64.6154 | 4200 | 1.4586 |
92
+ | 1.5333 | 66.1538 | 4300 | 1.4331 |
93
+ | 1.5227 | 67.6923 | 4400 | 1.4034 |
94
+ | 1.4919 | 69.2308 | 4500 | 1.4211 |
95
+ | 1.473 | 70.7692 | 4600 | 1.4025 |
96
+ | 1.4709 | 72.3077 | 4700 | 1.3727 |
97
+ | 1.4491 | 73.8462 | 4800 | 1.3621 |
98
+ | 1.4393 | 75.3846 | 4900 | 1.3533 |
99
+ | 1.4272 | 76.9231 | 5000 | 1.3210 |
100
+ | 1.4101 | 78.4615 | 5100 | 1.2969 |
101
+ | 1.393 | 80.0 | 5200 | 1.3298 |
102
+ | 1.3806 | 81.5385 | 5300 | 1.3060 |
103
+ | 1.3642 | 83.0769 | 5400 | 1.2768 |
104
+ | 1.3553 | 84.6154 | 5500 | 1.2668 |
105
+ | 1.344 | 86.1538 | 5600 | 1.2862 |
106
+ | 1.3368 | 87.6923 | 5700 | 1.2459 |
107
+ | 1.3291 | 89.2308 | 5800 | 1.2634 |
108
+ | 1.3107 | 90.7692 | 5900 | 1.2446 |
109
+ | 1.2974 | 92.3077 | 6000 | 1.2335 |
110
+ | 1.2824 | 93.8462 | 6100 | 1.2124 |
111
+ | 1.2752 | 95.3846 | 6200 | 1.2028 |
112
+ | 1.2713 | 96.9231 | 6300 | 1.1853 |
113
+ | 1.2636 | 98.4615 | 6400 | 1.1745 |
114
+ | 1.2541 | 100.0 | 6500 | 1.1671 |
115
+ | 1.2402 | 101.5385 | 6600 | 1.1789 |
116
+ | 1.2274 | 103.0769 | 6700 | 1.1636 |
117
+ | 1.209 | 104.6154 | 6800 | 1.1557 |
118
+ | 1.2107 | 106.1538 | 6900 | 1.1427 |
119
+ | 1.195 | 107.6923 | 7000 | 1.1452 |
120
+ | 1.1924 | 109.2308 | 7100 | 1.1183 |
121
+ | 1.173 | 110.7692 | 7200 | 1.1141 |
122
+ | 1.1747 | 112.3077 | 7300 | 1.1151 |
123
+ | 1.1589 | 113.8462 | 7400 | 1.0888 |
124
+ | 1.1472 | 115.3846 | 7500 | 1.1126 |
125
+ | 1.1413 | 116.9231 | 7600 | 1.0757 |
126
+ | 1.1389 | 118.4615 | 7700 | 1.0827 |
127
+ | 1.1363 | 120.0 | 7800 | 1.0719 |
128
+ | 1.125 | 121.5385 | 7900 | 1.0649 |
129
+ | 1.1192 | 123.0769 | 8000 | 1.0733 |
130
+ | 1.1039 | 124.6154 | 8100 | 1.0639 |
131
+ | 1.0987 | 126.1538 | 8200 | 1.0635 |
132
+ | 1.0895 | 127.6923 | 8300 | 1.0471 |
133
+ | 1.0843 | 129.2308 | 8400 | 1.0366 |
134
+ | 1.0791 | 130.7692 | 8500 | 1.0172 |
135
+ | 1.0736 | 132.3077 | 8600 | 1.0156 |
136
+ | 1.0716 | 133.8462 | 8700 | 1.0227 |
137
+ | 1.0617 | 135.3846 | 8800 | 1.0112 |
138
+ | 1.0605 | 136.9231 | 8900 | 1.0059 |
139
+ | 1.0505 | 138.4615 | 9000 | 1.0161 |
140
+ | 1.0402 | 140.0 | 9100 | 0.9990 |
141
+ | 1.027 | 141.5385 | 9200 | 0.9798 |
142
+ | 1.027 | 143.0769 | 9300 | 0.9820 |
143
+ | 1.0226 | 144.6154 | 9400 | 0.9800 |
144
+ | 1.0149 | 146.1538 | 9500 | 0.9804 |
145
+ | 1.0099 | 147.6923 | 9600 | 0.9631 |
146
+ | 0.9991 | 149.2308 | 9700 | 0.9735 |
147
+ | 0.9974 | 150.7692 | 9800 | 0.9420 |
148
+ | 0.9927 | 152.3077 | 9900 | 0.9527 |
149
+ | 0.9864 | 153.8462 | 10000 | 0.9456 |
150
+ | 0.981 | 155.3846 | 10100 | 0.9499 |
151
+ | 0.9725 | 156.9231 | 10200 | 0.9353 |
152
+ | 0.9622 | 158.4615 | 10300 | 0.9460 |
153
+ | 0.9653 | 160.0 | 10400 | 0.9444 |
154
+ | 0.9595 | 161.5385 | 10500 | 0.9407 |
155
+ | 0.9516 | 163.0769 | 10600 | 0.9261 |
156
+ | 0.9468 | 164.6154 | 10700 | 0.9103 |
157
+ | 0.9434 | 166.1538 | 10800 | 0.9017 |
158
+ | 0.9413 | 167.6923 | 10900 | 0.9202 |
159
+ | 0.9349 | 169.2308 | 11000 | 0.8925 |
160
+ | 0.9274 | 170.7692 | 11100 | 0.9180 |
161
+ | 0.9213 | 172.3077 | 11200 | 0.9043 |
162
+ | 0.9161 | 173.8462 | 11300 | 0.8984 |
163
+ | 0.9156 | 175.3846 | 11400 | 0.8863 |
164
+ | 0.9133 | 176.9231 | 11500 | 0.8892 |
165
+ | 0.9003 | 178.4615 | 11600 | 0.8647 |
166
+ | 0.9062 | 180.0 | 11700 | 0.8806 |
167
+ | 0.896 | 181.5385 | 11800 | 0.8749 |
168
+ | 0.8881 | 183.0769 | 11900 | 0.8743 |
169
+ | 0.8844 | 184.6154 | 12000 | 0.8717 |
170
+ | 0.8789 | 186.1538 | 12100 | 0.8551 |
171
+ | 0.873 | 187.6923 | 12200 | 0.8599 |
172
+ | 0.8647 | 189.2308 | 12300 | 0.8566 |
173
+ | 0.861 | 190.7692 | 12400 | 0.8457 |
174
+ | 0.861 | 192.3077 | 12500 | 0.8435 |
175
+ | 0.8609 | 193.8462 | 12600 | 0.8392 |
176
+ | 0.8581 | 195.3846 | 12700 | 0.8539 |
177
+ | 0.8608 | 196.9231 | 12800 | 0.8345 |
178
+ | 0.8439 | 198.4615 | 12900 | 0.8455 |
179
+ | 0.8467 | 200.0 | 13000 | 0.8218 |
180
+ | 0.8365 | 201.5385 | 13100 | 0.8170 |
181
+ | 0.8435 | 203.0769 | 13200 | 0.8254 |
182
+ | 0.8334 | 204.6154 | 13300 | 0.8237 |
183
+ | 0.8289 | 206.1538 | 13400 | 0.8124 |
184
+ | 0.825 | 207.6923 | 13500 | 0.8114 |
185
+ | 0.8257 | 209.2308 | 13600 | 0.8207 |
186
+ | 0.8232 | 210.7692 | 13700 | 0.8101 |
187
+ | 0.8136 | 212.3077 | 13800 | 0.8038 |
188
+ | 0.8101 | 213.8462 | 13900 | 0.7883 |
189
+ | 0.8103 | 215.3846 | 14000 | 0.8154 |
190
+ | 0.8094 | 216.9231 | 14100 | 0.8136 |
191
+ | 0.8046 | 218.4615 | 14200 | 0.8022 |
192
+ | 0.7982 | 220.0 | 14300 | 0.7850 |
193
+ | 0.7964 | 221.5385 | 14400 | 0.7885 |
194
+ | 0.7908 | 223.0769 | 14500 | 0.7828 |
195
+ | 0.7903 | 224.6154 | 14600 | 0.7993 |
196
+ | 0.7843 | 226.1538 | 14700 | 0.7816 |
197
+ | 0.7838 | 227.6923 | 14800 | 0.7872 |
198
+ | 0.7744 | 229.2308 | 14900 | 0.7867 |
199
+ | 0.7801 | 230.7692 | 15000 | 0.7815 |
200
+ | 0.7778 | 232.3077 | 15100 | 0.7670 |
201
+ | 0.7703 | 233.8462 | 15200 | 0.7681 |
202
+ | 0.7666 | 235.3846 | 15300 | 0.7521 |
203
+ | 0.7734 | 236.9231 | 15400 | 0.7673 |
204
+ | 0.7636 | 238.4615 | 15500 | 0.7399 |
205
+ | 0.7598 | 240.0 | 15600 | 0.7542 |
206
+ | 0.7565 | 241.5385 | 15700 | 0.7613 |
207
+ | 0.7498 | 243.0769 | 15800 | 0.7659 |
208
+ | 0.7476 | 244.6154 | 15900 | 0.7528 |
209
+ | 0.7447 | 246.1538 | 16000 | 0.7601 |
210
+ | 0.7457 | 247.6923 | 16100 | 0.7525 |
211
+ | 0.742 | 249.2308 | 16200 | 0.7393 |
212
+ | 0.7437 | 250.7692 | 16300 | 0.7475 |
213
+ | 0.7337 | 252.3077 | 16400 | 0.7339 |
214
+ | 0.7343 | 253.8462 | 16500 | 0.7410 |
215
+ | 0.7288 | 255.3846 | 16600 | 0.7453 |
216
+ | 0.7317 | 256.9231 | 16700 | 0.7389 |
217
+ | 0.7317 | 258.4615 | 16800 | 0.7139 |
218
+ | 0.7279 | 260.0 | 16900 | 0.7321 |
219
+ | 0.722 | 261.5385 | 17000 | 0.7334 |
220
+ | 0.7222 | 263.0769 | 17100 | 0.7276 |
221
+ | 0.7158 | 264.6154 | 17200 | 0.7125 |
222
+ | 0.7139 | 266.1538 | 17300 | 0.7207 |
223
+ | 0.7098 | 267.6923 | 17400 | 0.7147 |
224
+ | 0.7146 | 269.2308 | 17500 | 0.7124 |
225
+ | 0.709 | 270.7692 | 17600 | 0.7079 |
226
+ | 0.7001 | 272.3077 | 17700 | 0.7132 |
227
+ | 0.708 | 273.8462 | 17800 | 0.7249 |
228
+ | 0.7038 | 275.3846 | 17900 | 0.7088 |
229
+ | 0.6955 | 276.9231 | 18000 | 0.7163 |
230
+ | 0.7016 | 278.4615 | 18100 | 0.7112 |
231
+ | 0.6931 | 280.0 | 18200 | 0.6875 |
232
+ | 0.6959 | 281.5385 | 18300 | 0.6892 |
233
+ | 0.6946 | 283.0769 | 18400 | 0.6945 |
234
+ | 0.6904 | 284.6154 | 18500 | 0.6970 |
235
+ | 0.6872 | 286.1538 | 18600 | 0.7030 |
236
+ | 0.6925 | 287.6923 | 18700 | 0.7118 |
237
+ | 0.6848 | 289.2308 | 18800 | 0.6986 |
238
+ | 0.6796 | 290.7692 | 18900 | 0.6994 |
239
+ | 0.6821 | 292.3077 | 19000 | 0.6834 |
240
+ | 0.6763 | 293.8462 | 19100 | 0.7022 |
241
+ | 0.6741 | 295.3846 | 19200 | 0.7019 |
242
+ | 0.6723 | 296.9231 | 19300 | 0.7042 |
243
+ | 0.6724 | 298.4615 | 19400 | 0.6991 |
244
+ | 0.6735 | 300.0 | 19500 | 0.6952 |
245
+ | 0.6693 | 301.5385 | 19600 | 0.6833 |
246
+ | 0.6666 | 303.0769 | 19700 | 0.6976 |
247
+ | 0.6637 | 304.6154 | 19800 | 0.6659 |
248
+ | 0.6641 | 306.1538 | 19900 | 0.6742 |
249
+ | 0.6653 | 307.6923 | 20000 | 0.6942 |
250
+ | 0.661 | 309.2308 | 20100 | 0.6820 |
251
+ | 0.6607 | 310.7692 | 20200 | 0.6888 |
252
+ | 0.6565 | 312.3077 | 20300 | 0.6740 |
253
+ | 0.6548 | 313.8462 | 20400 | 0.6540 |
254
+ | 0.658 | 315.3846 | 20500 | 0.6790 |
255
+ | 0.6509 | 316.9231 | 20600 | 0.6719 |
256
+ | 0.6491 | 318.4615 | 20700 | 0.6671 |
257
+ | 0.6438 | 320.0 | 20800 | 0.6709 |
258
+ | 0.6442 | 321.5385 | 20900 | 0.6751 |
259
+ | 0.6445 | 323.0769 | 21000 | 0.6498 |
260
+ | 0.6477 | 324.6154 | 21100 | 0.6621 |
261
+ | 0.6458 | 326.1538 | 21200 | 0.6503 |
262
+ | 0.6354 | 327.6923 | 21300 | 0.6694 |
263
+ | 0.6417 | 329.2308 | 21400 | 0.6531 |
264
+ | 0.6402 | 330.7692 | 21500 | 0.6483 |
265
+ | 0.6375 | 332.3077 | 21600 | 0.6528 |
266
+ | 0.6366 | 333.8462 | 21700 | 0.6583 |
267
+ | 0.6271 | 335.3846 | 21800 | 0.6398 |
268
+ | 0.64 | 336.9231 | 21900 | 0.6522 |
269
+ | 0.6241 | 338.4615 | 22000 | 0.6477 |
270
+ | 0.6277 | 340.0 | 22100 | 0.6437 |
271
+ | 0.6271 | 341.5385 | 22200 | 0.6310 |
272
+ | 0.6303 | 343.0769 | 22300 | 0.6377 |
273
+ | 0.625 | 344.6154 | 22400 | 0.6276 |
274
+ | 0.6229 | 346.1538 | 22500 | 0.6371 |
275
+ | 0.6215 | 347.6923 | 22600 | 0.6336 |
276
+ | 0.6224 | 349.2308 | 22700 | 0.6406 |
277
+ | 0.6234 | 350.7692 | 22800 | 0.6338 |
278
+ | 0.6239 | 352.3077 | 22900 | 0.6255 |
279
+ | 0.6182 | 353.8462 | 23000 | 0.6374 |
280
+ | 0.6186 | 355.3846 | 23100 | 0.6470 |
281
+ | 0.6148 | 356.9231 | 23200 | 0.6366 |
282
+ | 0.6051 | 358.4615 | 23300 | 0.6359 |
283
+ | 0.6093 | 360.0 | 23400 | 0.6462 |
284
+ | 0.6115 | 361.5385 | 23500 | 0.6270 |
285
+ | 0.6098 | 363.0769 | 23600 | 0.6064 |
286
+ | 0.6174 | 364.6154 | 23700 | 0.6314 |
287
+ | 0.6012 | 366.1538 | 23800 | 0.6352 |
288
+ | 0.6091 | 367.6923 | 23900 | 0.6266 |
289
+ | 0.6058 | 369.2308 | 24000 | 0.6359 |
290
+ | 0.612 | 370.7692 | 24100 | 0.6283 |
291
+ | 0.6126 | 372.3077 | 24200 | 0.6280 |
292
+ | 0.6075 | 373.8462 | 24300 | 0.6117 |
293
+ | 0.6053 | 375.3846 | 24400 | 0.6152 |
294
+ | 0.6 | 376.9231 | 24500 | 0.6267 |
295
+ | 0.6045 | 378.4615 | 24600 | 0.6252 |
 
 
 
 
 
 
 
 
 
 
 
 
 
296
 
297
 
298
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:55923558453cb2f52034ccdf3bc400251dedecb3040e40d90a2324540827550b
3
  size 265721304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:887c74711367a0b7fbe515ac6439e3572117020218ff96bcbd091e3eb7640c92
3
  size 265721304