End of training
Browse files- README.md +23 -23
- model.safetensors +1 -1
README.md
CHANGED
@@ -1,25 +1,25 @@
|
|
1 |
---
|
2 |
-
base_model: IAmSkyDra/BARTBana_Before
|
3 |
library_name: transformers
|
4 |
license: mit
|
5 |
-
|
6 |
-
- sacrebleu
|
7 |
tags:
|
8 |
- generated_from_trainer
|
|
|
|
|
9 |
model-index:
|
10 |
-
- name:
|
11 |
results: []
|
12 |
---
|
13 |
|
14 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
15 |
should probably proofread and complete it, then remove this comment. -->
|
16 |
|
17 |
-
#
|
18 |
|
19 |
-
This model is a fine-tuned version of [IAmSkyDra/
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.
|
22 |
-
- Sacrebleu: 11.
|
23 |
|
24 |
## Model description
|
25 |
|
@@ -51,21 +51,21 @@ The following hyperparameters were used during training:
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
|
53 |
|:-------------:|:-----:|:-----:|:---------------:|:---------:|
|
54 |
-
| 0.
|
55 |
-
| 0.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
|
70 |
|
71 |
### Framework versions
|
|
|
1 |
---
|
|
|
2 |
library_name: transformers
|
3 |
license: mit
|
4 |
+
base_model: IAmSkyDra/BARTBana
|
|
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
+
metrics:
|
8 |
+
- sacrebleu
|
9 |
model-index:
|
10 |
+
- name: BARTBana_Translation_v2
|
11 |
results: []
|
12 |
---
|
13 |
|
14 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
15 |
should probably proofread and complete it, then remove this comment. -->
|
16 |
|
17 |
+
# BARTBana_Translation_v2
|
18 |
|
19 |
+
This model is a fine-tuned version of [IAmSkyDra/BARTBana](https://huggingface.co/IAmSkyDra/BARTBana) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 0.4520
|
22 |
+
- Sacrebleu: 11.7352
|
23 |
|
24 |
## Model description
|
25 |
|
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
|
53 |
|:-------------:|:-----:|:-----:|:---------------:|:---------:|
|
54 |
+
| 0.695 | 1.0 | 742 | 0.6021 | 6.3321 |
|
55 |
+
| 0.5976 | 2.0 | 1484 | 0.5291 | 8.6429 |
|
56 |
+
| 0.5171 | 3.0 | 2226 | 0.4958 | 9.7101 |
|
57 |
+
| 0.4919 | 4.0 | 2968 | 0.4781 | 10.3323 |
|
58 |
+
| 0.4556 | 5.0 | 3710 | 0.4680 | 10.7812 |
|
59 |
+
| 0.4387 | 6.0 | 4452 | 0.4577 | 10.8965 |
|
60 |
+
| 0.4095 | 7.0 | 5194 | 0.4538 | 11.1963 |
|
61 |
+
| 0.3924 | 8.0 | 5936 | 0.4499 | 11.2119 |
|
62 |
+
| 0.3815 | 9.0 | 6678 | 0.4486 | 11.4155 |
|
63 |
+
| 0.3647 | 10.0 | 7420 | 0.4468 | 11.4443 |
|
64 |
+
| 0.3525 | 11.0 | 8162 | 0.4479 | 11.5941 |
|
65 |
+
| 0.3435 | 12.0 | 8904 | 0.4489 | 11.5933 |
|
66 |
+
| 0.3349 | 13.0 | 9646 | 0.4500 | 11.7211 |
|
67 |
+
| 0.3289 | 14.0 | 10388 | 0.4508 | 11.7113 |
|
68 |
+
| 0.3202 | 15.0 | 11130 | 0.4520 | 11.7352 |
|
69 |
|
70 |
|
71 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1583480280
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3580d12a16812e84bdc9a77fa40bac38321fdecd0329f8972f6f6dcd2da54433
|
3 |
size 1583480280
|