End of training
Browse files- README.md +262 -39
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model is a fine-tuned version of [adalbertojunior/distilbert-portuguese-cased](https://huggingface.co/adalbertojunior/distilbert-portuguese-cased) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
- Loss:
|
19 |
|
20 |
## Model description
|
21 |
|
@@ -45,44 +45,267 @@ The following hyperparameters were used during training:
|
|
45 |
|
46 |
### Training results
|
47 |
|
48 |
-
| Training Loss | Epoch
|
49 |
-
|
50 |
-
| 6.
|
51 |
-
| 5.
|
52 |
-
| 4.
|
53 |
-
|
|
54 |
-
| 3.
|
55 |
-
| 3.
|
56 |
-
| 3.
|
57 |
-
| 3.
|
58 |
-
| 2.
|
59 |
-
| 2.
|
60 |
-
| 2.
|
61 |
-
| 2.
|
62 |
-
| 2.
|
63 |
-
| 2.
|
64 |
-
| 2.
|
65 |
-
| 2.
|
66 |
-
| 2.
|
67 |
-
| 2.
|
68 |
-
| 2.
|
69 |
-
| 2.
|
70 |
-
| 2.
|
71 |
-
| 2.
|
72 |
-
| 2.
|
73 |
-
| 2.
|
74 |
-
| 1.
|
75 |
-
| 1.
|
76 |
-
| 1.
|
77 |
-
| 1.
|
78 |
-
| 1.
|
79 |
-
| 1.
|
80 |
-
| 1.
|
81 |
-
| 1.
|
82 |
-
| 1.
|
83 |
-
| 1.
|
84 |
-
| 1.
|
85 |
-
| 1.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
86 |
|
87 |
|
88 |
### Framework versions
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [adalbertojunior/distilbert-portuguese-cased](https://huggingface.co/adalbertojunior/distilbert-portuguese-cased) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 0.6466
|
19 |
|
20 |
## Model description
|
21 |
|
|
|
45 |
|
46 |
### Training results
|
47 |
|
48 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
49 |
+
|:-------------:|:--------:|:-----:|:---------------:|
|
50 |
+
| 6.8123 | 1.3889 | 100 | 5.5177 |
|
51 |
+
| 5.1647 | 2.7778 | 200 | 4.6195 |
|
52 |
+
| 4.4717 | 4.1667 | 300 | 4.0395 |
|
53 |
+
| 4.0232 | 5.5556 | 400 | 3.6607 |
|
54 |
+
| 3.6917 | 6.9444 | 500 | 3.3826 |
|
55 |
+
| 3.4525 | 8.3333 | 600 | 3.1628 |
|
56 |
+
| 3.2549 | 9.7222 | 700 | 3.0003 |
|
57 |
+
| 3.0811 | 11.1111 | 800 | 2.8633 |
|
58 |
+
| 2.959 | 12.5 | 900 | 2.7506 |
|
59 |
+
| 2.8471 | 13.8889 | 1000 | 2.6297 |
|
60 |
+
| 2.7321 | 15.2778 | 1100 | 2.5441 |
|
61 |
+
| 2.6444 | 16.6667 | 1200 | 2.4690 |
|
62 |
+
| 2.5641 | 18.0556 | 1300 | 2.3772 |
|
63 |
+
| 2.4889 | 19.4444 | 1400 | 2.3022 |
|
64 |
+
| 2.4214 | 20.8333 | 1500 | 2.2521 |
|
65 |
+
| 2.3677 | 22.2222 | 1600 | 2.2045 |
|
66 |
+
| 2.3108 | 23.6111 | 1700 | 2.1531 |
|
67 |
+
| 2.2519 | 25.0 | 1800 | 2.1167 |
|
68 |
+
| 2.2159 | 26.3889 | 1900 | 2.0711 |
|
69 |
+
| 2.1751 | 27.7778 | 2000 | 2.0200 |
|
70 |
+
| 2.1338 | 29.1667 | 2100 | 1.9792 |
|
71 |
+
| 2.092 | 30.5556 | 2200 | 1.9560 |
|
72 |
+
| 2.0469 | 31.9444 | 2300 | 1.9302 |
|
73 |
+
| 2.0119 | 33.3333 | 2400 | 1.8737 |
|
74 |
+
| 1.9751 | 34.7222 | 2500 | 1.8639 |
|
75 |
+
| 1.9557 | 36.1111 | 2600 | 1.8357 |
|
76 |
+
| 1.9265 | 37.5 | 2700 | 1.8006 |
|
77 |
+
| 1.8883 | 38.8889 | 2800 | 1.7937 |
|
78 |
+
| 1.862 | 40.2778 | 2900 | 1.7344 |
|
79 |
+
| 1.8457 | 41.6667 | 3000 | 1.7238 |
|
80 |
+
| 1.811 | 43.0556 | 3100 | 1.7025 |
|
81 |
+
| 1.7889 | 44.4444 | 3200 | 1.6837 |
|
82 |
+
| 1.7656 | 45.8333 | 3300 | 1.6712 |
|
83 |
+
| 1.7372 | 47.2222 | 3400 | 1.6261 |
|
84 |
+
| 1.7189 | 48.6111 | 3500 | 1.6136 |
|
85 |
+
| 1.6957 | 50.0 | 3600 | 1.6015 |
|
86 |
+
| 1.6774 | 51.3889 | 3700 | 1.5803 |
|
87 |
+
| 1.6551 | 52.7778 | 3800 | 1.5728 |
|
88 |
+
| 1.638 | 54.1667 | 3900 | 1.5398 |
|
89 |
+
| 1.6161 | 55.5556 | 4000 | 1.5423 |
|
90 |
+
| 1.5986 | 56.9444 | 4100 | 1.5037 |
|
91 |
+
| 1.5852 | 58.3333 | 4200 | 1.4801 |
|
92 |
+
| 1.5718 | 59.7222 | 4300 | 1.4826 |
|
93 |
+
| 1.5483 | 61.1111 | 4400 | 1.4776 |
|
94 |
+
| 1.5326 | 62.5 | 4500 | 1.4548 |
|
95 |
+
| 1.5228 | 63.8889 | 4600 | 1.4442 |
|
96 |
+
| 1.4965 | 65.2778 | 4700 | 1.4031 |
|
97 |
+
| 1.4702 | 66.6667 | 4800 | 1.3834 |
|
98 |
+
| 1.4603 | 68.0556 | 4900 | 1.3778 |
|
99 |
+
| 1.441 | 69.4444 | 5000 | 1.3707 |
|
100 |
+
| 1.4263 | 70.8333 | 5100 | 1.3522 |
|
101 |
+
| 1.4136 | 72.2222 | 5200 | 1.3273 |
|
102 |
+
| 1.399 | 73.6111 | 5300 | 1.3429 |
|
103 |
+
| 1.3844 | 75.0 | 5400 | 1.3061 |
|
104 |
+
| 1.3724 | 76.3889 | 5500 | 1.3003 |
|
105 |
+
| 1.3596 | 77.7778 | 5600 | 1.2754 |
|
106 |
+
| 1.3488 | 79.1667 | 5700 | 1.2679 |
|
107 |
+
| 1.3414 | 80.5556 | 5800 | 1.2614 |
|
108 |
+
| 1.3335 | 81.9444 | 5900 | 1.2568 |
|
109 |
+
| 1.3165 | 83.3333 | 6000 | 1.2440 |
|
110 |
+
| 1.3078 | 84.7222 | 6100 | 1.2387 |
|
111 |
+
| 1.2914 | 86.1111 | 6200 | 1.2341 |
|
112 |
+
| 1.2867 | 87.5 | 6300 | 1.2264 |
|
113 |
+
| 1.2758 | 88.8889 | 6400 | 1.2150 |
|
114 |
+
| 1.2709 | 90.2778 | 6500 | 1.2056 |
|
115 |
+
| 1.257 | 91.6667 | 6600 | 1.2121 |
|
116 |
+
| 1.2455 | 93.0556 | 6700 | 1.1860 |
|
117 |
+
| 1.2354 | 94.4444 | 6800 | 1.1787 |
|
118 |
+
| 1.2298 | 95.8333 | 6900 | 1.1604 |
|
119 |
+
| 1.2202 | 97.2222 | 7000 | 1.1632 |
|
120 |
+
| 1.2045 | 98.6111 | 7100 | 1.1477 |
|
121 |
+
| 1.2062 | 100.0 | 7200 | 1.1484 |
|
122 |
+
| 1.2039 | 101.3889 | 7300 | 1.1493 |
|
123 |
+
| 1.1851 | 102.7778 | 7400 | 1.1298 |
|
124 |
+
| 1.1806 | 104.1667 | 7500 | 1.1277 |
|
125 |
+
| 1.1616 | 105.5556 | 7600 | 1.1080 |
|
126 |
+
| 1.1614 | 106.9444 | 7700 | 1.1081 |
|
127 |
+
| 1.1504 | 108.3333 | 7800 | 1.1334 |
|
128 |
+
| 1.1407 | 109.7222 | 7900 | 1.1024 |
|
129 |
+
| 1.1318 | 111.1111 | 8000 | 1.0949 |
|
130 |
+
| 1.1258 | 112.5 | 8100 | 1.0917 |
|
131 |
+
| 1.1212 | 113.8889 | 8200 | 1.0718 |
|
132 |
+
| 1.119 | 115.2778 | 8300 | 1.0893 |
|
133 |
+
| 1.102 | 116.6667 | 8400 | 1.0606 |
|
134 |
+
| 1.091 | 118.0556 | 8500 | 1.0709 |
|
135 |
+
| 1.0834 | 119.4444 | 8600 | 1.0493 |
|
136 |
+
| 1.0964 | 120.8333 | 8700 | 1.0448 |
|
137 |
+
| 1.0775 | 122.2222 | 8800 | 1.0432 |
|
138 |
+
| 1.076 | 123.6111 | 8900 | 1.0309 |
|
139 |
+
| 1.0602 | 125.0 | 9000 | 1.0191 |
|
140 |
+
| 1.0583 | 126.3889 | 9100 | 1.0346 |
|
141 |
+
| 1.052 | 127.7778 | 9200 | 1.0326 |
|
142 |
+
| 1.0416 | 129.1667 | 9300 | 1.0146 |
|
143 |
+
| 1.0404 | 130.5556 | 9400 | 1.0035 |
|
144 |
+
| 1.0254 | 131.9444 | 9500 | 1.0022 |
|
145 |
+
| 1.0302 | 133.3333 | 9600 | 1.0067 |
|
146 |
+
| 1.0219 | 134.7222 | 9700 | 1.0029 |
|
147 |
+
| 1.0171 | 136.1111 | 9800 | 0.9713 |
|
148 |
+
| 1.0043 | 137.5 | 9900 | 0.9969 |
|
149 |
+
| 1.0014 | 138.8889 | 10000 | 0.9847 |
|
150 |
+
| 0.9972 | 140.2778 | 10100 | 0.9827 |
|
151 |
+
| 0.9969 | 141.6667 | 10200 | 0.9771 |
|
152 |
+
| 0.9848 | 143.0556 | 10300 | 0.9696 |
|
153 |
+
| 0.9851 | 144.4444 | 10400 | 0.9619 |
|
154 |
+
| 0.9735 | 145.8333 | 10500 | 0.9598 |
|
155 |
+
| 0.9652 | 147.2222 | 10600 | 0.9435 |
|
156 |
+
| 0.9669 | 148.6111 | 10700 | 0.9475 |
|
157 |
+
| 0.9594 | 150.0 | 10800 | 0.9416 |
|
158 |
+
| 0.9584 | 151.3889 | 10900 | 0.9433 |
|
159 |
+
| 0.9486 | 152.7778 | 11000 | 0.9389 |
|
160 |
+
| 0.9456 | 154.1667 | 11100 | 0.9329 |
|
161 |
+
| 0.9399 | 155.5556 | 11200 | 0.9354 |
|
162 |
+
| 0.9265 | 156.9444 | 11300 | 0.9146 |
|
163 |
+
| 0.9269 | 158.3333 | 11400 | 0.9213 |
|
164 |
+
| 0.9333 | 159.7222 | 11500 | 0.9171 |
|
165 |
+
| 0.9222 | 161.1111 | 11600 | 0.9276 |
|
166 |
+
| 0.9171 | 162.5 | 11700 | 0.9104 |
|
167 |
+
| 0.9153 | 163.8889 | 11800 | 0.9081 |
|
168 |
+
| 0.9018 | 165.2778 | 11900 | 0.9064 |
|
169 |
+
| 0.9097 | 166.6667 | 12000 | 0.8837 |
|
170 |
+
| 0.8998 | 168.0556 | 12100 | 0.8802 |
|
171 |
+
| 0.8904 | 169.4444 | 12200 | 0.8866 |
|
172 |
+
| 0.8876 | 170.8333 | 12300 | 0.8672 |
|
173 |
+
| 0.8893 | 172.2222 | 12400 | 0.8894 |
|
174 |
+
| 0.8816 | 173.6111 | 12500 | 0.8660 |
|
175 |
+
| 0.88 | 175.0 | 12600 | 0.8911 |
|
176 |
+
| 0.8767 | 176.3889 | 12700 | 0.8532 |
|
177 |
+
| 0.8651 | 177.7778 | 12800 | 0.8675 |
|
178 |
+
| 0.8625 | 179.1667 | 12900 | 0.8567 |
|
179 |
+
| 0.8574 | 180.5556 | 13000 | 0.8608 |
|
180 |
+
| 0.8591 | 181.9444 | 13100 | 0.8706 |
|
181 |
+
| 0.8526 | 183.3333 | 13200 | 0.8568 |
|
182 |
+
| 0.8492 | 184.7222 | 13300 | 0.8423 |
|
183 |
+
| 0.8481 | 186.1111 | 13400 | 0.8570 |
|
184 |
+
| 0.8452 | 187.5 | 13500 | 0.8302 |
|
185 |
+
| 0.841 | 188.8889 | 13600 | 0.8306 |
|
186 |
+
| 0.8429 | 190.2778 | 13700 | 0.8372 |
|
187 |
+
| 0.83 | 191.6667 | 13800 | 0.8337 |
|
188 |
+
| 0.8356 | 193.0556 | 13900 | 0.8261 |
|
189 |
+
| 0.8318 | 194.4444 | 14000 | 0.8363 |
|
190 |
+
| 0.8218 | 195.8333 | 14100 | 0.8136 |
|
191 |
+
| 0.82 | 197.2222 | 14200 | 0.8140 |
|
192 |
+
| 0.8111 | 198.6111 | 14300 | 0.8330 |
|
193 |
+
| 0.8128 | 200.0 | 14400 | 0.8203 |
|
194 |
+
| 0.8082 | 201.3889 | 14500 | 0.8001 |
|
195 |
+
| 0.8071 | 202.7778 | 14600 | 0.8090 |
|
196 |
+
| 0.8033 | 204.1667 | 14700 | 0.8148 |
|
197 |
+
| 0.7964 | 205.5556 | 14800 | 0.7944 |
|
198 |
+
| 0.7965 | 206.9444 | 14900 | 0.8101 |
|
199 |
+
| 0.7936 | 208.3333 | 15000 | 0.7992 |
|
200 |
+
| 0.7838 | 209.7222 | 15100 | 0.8061 |
|
201 |
+
| 0.7834 | 211.1111 | 15200 | 0.7989 |
|
202 |
+
| 0.7829 | 212.5 | 15300 | 0.7893 |
|
203 |
+
| 0.7779 | 213.8889 | 15400 | 0.8032 |
|
204 |
+
| 0.7761 | 215.2778 | 15500 | 0.7841 |
|
205 |
+
| 0.7776 | 216.6667 | 15600 | 0.7834 |
|
206 |
+
| 0.7743 | 218.0556 | 15700 | 0.7865 |
|
207 |
+
| 0.7696 | 219.4444 | 15800 | 0.7808 |
|
208 |
+
| 0.7702 | 220.8333 | 15900 | 0.7761 |
|
209 |
+
| 0.7608 | 222.2222 | 16000 | 0.7916 |
|
210 |
+
| 0.7571 | 223.6111 | 16100 | 0.7580 |
|
211 |
+
| 0.7569 | 225.0 | 16200 | 0.7800 |
|
212 |
+
| 0.7495 | 226.3889 | 16300 | 0.7717 |
|
213 |
+
| 0.7554 | 227.7778 | 16400 | 0.7718 |
|
214 |
+
| 0.7455 | 229.1667 | 16500 | 0.7549 |
|
215 |
+
| 0.7476 | 230.5556 | 16600 | 0.7609 |
|
216 |
+
| 0.7477 | 231.9444 | 16700 | 0.7813 |
|
217 |
+
| 0.7495 | 233.3333 | 16800 | 0.7411 |
|
218 |
+
| 0.7328 | 234.7222 | 16900 | 0.7550 |
|
219 |
+
| 0.7363 | 236.1111 | 17000 | 0.7476 |
|
220 |
+
| 0.732 | 237.5 | 17100 | 0.7501 |
|
221 |
+
| 0.7353 | 238.8889 | 17200 | 0.7566 |
|
222 |
+
| 0.7294 | 240.2778 | 17300 | 0.7464 |
|
223 |
+
| 0.7231 | 241.6667 | 17400 | 0.7455 |
|
224 |
+
| 0.7227 | 243.0556 | 17500 | 0.7385 |
|
225 |
+
| 0.7225 | 244.4444 | 17600 | 0.7269 |
|
226 |
+
| 0.7166 | 245.8333 | 17700 | 0.7340 |
|
227 |
+
| 0.7147 | 247.2222 | 17800 | 0.7361 |
|
228 |
+
| 0.7158 | 248.6111 | 17900 | 0.7351 |
|
229 |
+
| 0.7163 | 250.0 | 18000 | 0.7336 |
|
230 |
+
| 0.7112 | 251.3889 | 18100 | 0.7418 |
|
231 |
+
| 0.7073 | 252.7778 | 18200 | 0.7328 |
|
232 |
+
| 0.7067 | 254.1667 | 18300 | 0.7345 |
|
233 |
+
| 0.7094 | 255.5556 | 18400 | 0.7278 |
|
234 |
+
| 0.7047 | 256.9444 | 18500 | 0.7147 |
|
235 |
+
| 0.7006 | 258.3333 | 18600 | 0.7229 |
|
236 |
+
| 0.6921 | 259.7222 | 18700 | 0.7239 |
|
237 |
+
| 0.6998 | 261.1111 | 18800 | 0.7226 |
|
238 |
+
| 0.6939 | 262.5 | 18900 | 0.7211 |
|
239 |
+
| 0.6934 | 263.8889 | 19000 | 0.7052 |
|
240 |
+
| 0.6868 | 265.2778 | 19100 | 0.7150 |
|
241 |
+
| 0.6799 | 266.6667 | 19200 | 0.7285 |
|
242 |
+
| 0.6835 | 268.0556 | 19300 | 0.7128 |
|
243 |
+
| 0.6865 | 269.4444 | 19400 | 0.7006 |
|
244 |
+
| 0.688 | 270.8333 | 19500 | 0.7135 |
|
245 |
+
| 0.6798 | 272.2222 | 19600 | 0.6953 |
|
246 |
+
| 0.6746 | 273.6111 | 19700 | 0.7109 |
|
247 |
+
| 0.6783 | 275.0 | 19800 | 0.7154 |
|
248 |
+
| 0.6732 | 276.3889 | 19900 | 0.7115 |
|
249 |
+
| 0.6715 | 277.7778 | 20000 | 0.6976 |
|
250 |
+
| 0.6702 | 279.1667 | 20100 | 0.6889 |
|
251 |
+
| 0.6699 | 280.5556 | 20200 | 0.6835 |
|
252 |
+
| 0.6663 | 281.9444 | 20300 | 0.6947 |
|
253 |
+
| 0.6622 | 283.3333 | 20400 | 0.6844 |
|
254 |
+
| 0.6618 | 284.7222 | 20500 | 0.6868 |
|
255 |
+
| 0.6674 | 286.1111 | 20600 | 0.6933 |
|
256 |
+
| 0.6567 | 287.5 | 20700 | 0.6893 |
|
257 |
+
| 0.6593 | 288.8889 | 20800 | 0.6868 |
|
258 |
+
| 0.6613 | 290.2778 | 20900 | 0.6828 |
|
259 |
+
| 0.6635 | 291.6667 | 21000 | 0.6707 |
|
260 |
+
| 0.6523 | 293.0556 | 21100 | 0.6829 |
|
261 |
+
| 0.6566 | 294.4444 | 21200 | 0.6748 |
|
262 |
+
| 0.6513 | 295.8333 | 21300 | 0.6787 |
|
263 |
+
| 0.6539 | 297.2222 | 21400 | 0.6762 |
|
264 |
+
| 0.6436 | 298.6111 | 21500 | 0.6711 |
|
265 |
+
| 0.6433 | 300.0 | 21600 | 0.6742 |
|
266 |
+
| 0.6443 | 301.3889 | 21700 | 0.6656 |
|
267 |
+
| 0.6354 | 302.7778 | 21800 | 0.6677 |
|
268 |
+
| 0.6465 | 304.1667 | 21900 | 0.6740 |
|
269 |
+
| 0.6373 | 305.5556 | 22000 | 0.6732 |
|
270 |
+
| 0.6363 | 306.9444 | 22100 | 0.6639 |
|
271 |
+
| 0.6313 | 308.3333 | 22200 | 0.6699 |
|
272 |
+
| 0.6318 | 309.7222 | 22300 | 0.6569 |
|
273 |
+
| 0.6372 | 311.1111 | 22400 | 0.6557 |
|
274 |
+
| 0.6333 | 312.5 | 22500 | 0.6539 |
|
275 |
+
| 0.6307 | 313.8889 | 22600 | 0.6626 |
|
276 |
+
| 0.6259 | 315.2778 | 22700 | 0.6710 |
|
277 |
+
| 0.6288 | 316.6667 | 22800 | 0.6698 |
|
278 |
+
| 0.6218 | 318.0556 | 22900 | 0.6599 |
|
279 |
+
| 0.6305 | 319.4444 | 23000 | 0.6728 |
|
280 |
+
| 0.6225 | 320.8333 | 23100 | 0.6600 |
|
281 |
+
| 0.6227 | 322.2222 | 23200 | 0.6512 |
|
282 |
+
| 0.624 | 323.6111 | 23300 | 0.6611 |
|
283 |
+
| 0.6198 | 325.0 | 23400 | 0.6473 |
|
284 |
+
| 0.622 | 326.3889 | 23500 | 0.6617 |
|
285 |
+
| 0.6106 | 327.7778 | 23600 | 0.6658 |
|
286 |
+
| 0.6183 | 329.1667 | 23700 | 0.6477 |
|
287 |
+
| 0.6169 | 330.5556 | 23800 | 0.6394 |
|
288 |
+
| 0.6157 | 331.9444 | 23900 | 0.6352 |
|
289 |
+
| 0.614 | 333.3333 | 24000 | 0.6488 |
|
290 |
+
| 0.6165 | 334.7222 | 24100 | 0.6331 |
|
291 |
+
| 0.6111 | 336.1111 | 24200 | 0.6334 |
|
292 |
+
| 0.6117 | 337.5 | 24300 | 0.6381 |
|
293 |
+
| 0.6126 | 338.8889 | 24400 | 0.6349 |
|
294 |
+
| 0.6026 | 340.2778 | 24500 | 0.6435 |
|
295 |
+
| 0.6045 | 341.6667 | 24600 | 0.6470 |
|
296 |
+
| 0.6021 | 343.0556 | 24700 | 0.6447 |
|
297 |
+
| 0.6005 | 344.4444 | 24800 | 0.6343 |
|
298 |
+
| 0.6012 | 345.8333 | 24900 | 0.6233 |
|
299 |
+
| 0.5969 | 347.2222 | 25000 | 0.6348 |
|
300 |
+
| 0.6008 | 348.6111 | 25100 | 0.6423 |
|
301 |
+
| 0.5962 | 350.0 | 25200 | 0.6342 |
|
302 |
+
| 0.5981 | 351.3889 | 25300 | 0.6258 |
|
303 |
+
| 0.6001 | 352.7778 | 25400 | 0.6345 |
|
304 |
+
| 0.6012 | 354.1667 | 25500 | 0.6331 |
|
305 |
+
| 0.5912 | 355.5556 | 25600 | 0.6420 |
|
306 |
+
| 0.585 | 356.9444 | 25700 | 0.6298 |
|
307 |
+
| 0.5924 | 358.3333 | 25800 | 0.6444 |
|
308 |
+
| 0.5875 | 359.7222 | 25900 | 0.6256 |
|
309 |
|
310 |
|
311 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 265721304
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:55923558453cb2f52034ccdf3bc400251dedecb3040e40d90a2324540827550b
|
3 |
size 265721304
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5240
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5cec22cb33f3d5dde1728317865abc088ff5009de1e0d46c22dbe1160d8a2067
|
3 |
size 5240
|