add model

Files changed (4) hide show

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 2.8314
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the xsum dataset.
 It achieves the following results on the evaluation set:
 - Loss: nan
-- Rouge1: 2.8314
-- Rouge2: 0.3142
-- Rougel: 2.6475
-- Rougelsum: 2.6485
 - Gen Len: 4.9416
 ## Model description
@@ -54,8 +54,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -64,9 +64,9 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| nan           | 1.0   | 6377 | nan             | 2.8314 | 0.3142 | 2.6475 | 2.6485    | 4.9416  |
 ### Framework versions

     metrics:
     - name: Rouge1
       type: rouge
+      value: 2.8351
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the xsum dataset.
 It achieves the following results on the evaluation set:
 - Loss: nan
+- Rouge1: 2.8351
+- Rouge2: 0.3143
+- Rougel: 2.6488
+- Rougelsum: 2.6463
 - Gen Len: 4.9416
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| nan           | 1.0   | 12753 | nan             | 2.8351 | 0.3143 | 2.6488 | 2.6463    | 4.9416  |
 ### Framework versions

runs/Sep21_12-04-25_cadff6dd71cd/1632226261.0087423/events.out.tfevents.1632226261.cadff6dd71cd.90.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:70397e2e242259734c3f26b1d10ee53d54ca66cdb9e63faff0adf9934515334e
+size 4450

runs/Sep21_12-04-25_cadff6dd71cd/events.out.tfevents.1632226260.cadff6dd71cd.90.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ba4136f5b139d0be1de2821aa4837d4383243e42a39d83fa524e71bf4487996
+size 7915

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d2c4fc316d12e82918e897fb14271d4f2cf16d2fd193ddb5f481ea2e7082a6f0
 size 2799

 version https://git-lfs.github.com/spec/v1
+oid sha256:804275309efbc57096b328641731daaa2c05b66f35a20796e77d6de5c5ff371f
 size 2799