IAmSkyDra
/

BARTBana_Translation_Finetune_v0

Text2Text Generation

Transformers

Safetensors

mbart

Generated from Trainer

Model card Files Files and versions Community

IAmSkyDra commited on Jan 27

Commit

d3be9d7

verified ·

1 Parent(s): b72861d

End of training

Browse files

Files changed (2) hide show

README.md +20 -15
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
 license: mit
-base_model: vinai/bartpho-syllable
 tags:
 - generated_from_trainer
 metrics:
@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # BARTBana_Translation_Finetune_v0
-This model is a fine-tuned version of [vinai/bartpho-syllable](https://huggingface.co/vinai/bartpho-syllable) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4266
-- Sacrebleu: 2.9607
 ## Model description
@@ -44,23 +44,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|
-| 0.8315        | 1.0   | 429  | 0.6994          | 1.5712    |
-| 0.7319        | 2.0   | 858  | 0.6374          | 2.6693    |
-| 0.6676        | 3.0   | 1287 | 0.5882          | 4.0478    |
-| 0.6199        | 4.0   | 1716 | 0.5549          | 4.4905    |
-| 0.5912        | 5.0   | 2145 | 0.5353          | 5.0681    |
-| 0.5583        | 6.0   | 2574 | 0.5219          | 5.7212    |
-| 0.5488        | 7.0   | 3003 | 0.5117          | 6.1119    |
-| 0.5294        | 8.0   | 3432 | 0.5052          | 5.9770    |
-| 0.5227        | 9.0   | 3861 | 0.5020          | 6.2340    |
-| 0.5113        | 10.0  | 4290 | 0.5011          | 6.2681    |
 ### Framework versions

 ---
 library_name: transformers
 license: mit
+base_model: IAmSkyDra/BARTBana_v4
 tags:
 - generated_from_trainer
 metrics:
 # BARTBana_Translation_Finetune_v0
+This model is a fine-tuned version of [IAmSkyDra/BARTBana_v4](https://huggingface.co/IAmSkyDra/BARTBana_v4) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4295
+- Sacrebleu: 7.5050
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 15
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|
+| 0.6831        | 1.0   | 468  | 0.5818          | 3.2556    |
+| 0.566         | 2.0   | 936  | 0.5188          | 4.6548    |
+| 0.5127        | 3.0   | 1404 | 0.4878          | 5.3508    |
+| 0.4804        | 4.0   | 1872 | 0.4683          | 5.8657    |
+| 0.4558        | 5.0   | 2340 | 0.4551          | 6.2975    |
+| 0.433         | 6.0   | 2808 | 0.4450          | 6.4311    |
+| 0.4146        | 7.0   | 3276 | 0.4420          | 6.7296    |
+| 0.3969        | 8.0   | 3744 | 0.4365          | 6.9791    |
+| 0.3911        | 9.0   | 4212 | 0.4332          | 7.1487    |
+| 0.3742        | 10.0  | 4680 | 0.4302          | 7.2803    |
+| 0.3686        | 11.0  | 5148 | 0.4292          | 7.3851    |
+| 0.3568        | 12.0  | 5616 | 0.4296          | 7.4003    |
+| 0.3505        | 13.0  | 6084 | 0.4292          | 7.4202    |
+| 0.3503        | 14.0  | 6552 | 0.4289          | 7.4984    |
+| 0.3453        | 15.0  | 7020 | 0.4295          | 7.5050    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d59785d3bca7939612a296bc8c9204f8a23b65632fd4ef5e8f22840c12c67a52
 size 1583480280

 version https://git-lfs.github.com/spec/v1
+oid sha256:57073ab2e681328d8b8118b2bfcfaa6f9b35312bc03e8167305a55df3a2036e1
 size 1583480280