pszemraj
/

tFINE-base-300m-samsum

text2text-generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Aug 14

Commit

5a95ac7

•

1 Parent(s): 4b6c14e

Update README.md

Files changed (1) hide show

README.md +8 -22

README.md CHANGED Viewed

@@ -25,14 +25,13 @@ model-index:
     - name: Rouge1
       type: rouge
       value: 42.3629
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # tFINE-base-300m-samsum
-This model is a fine-tuned version of [pszemraj/tFINE-base-300m](https://huggingface.co/pszemraj/tFINE-base-300m) on the samsum dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.9820
 - Rouge1: 42.3629
@@ -41,17 +40,8 @@ It achieves the following results on the evaluation set:
 - Rougelsum: 38.7792
 - Gen Len: 27.8033
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -71,6 +61,9 @@ The following hyperparameters were used during training:
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
 | 1.9528        | 0.9989 | 115  | 1.9189          | 40.093  | 18.2018 | 33.9749 | 36.9071   | 29.3333 |
@@ -78,10 +71,3 @@ The following hyperparameters were used during training:
 | 1.1696        | 2.9967 | 345  | 1.9820          | 42.3629 | 18.4285 | 34.6339 | 38.7792   | 27.8033 |
 | 0.9359        | 3.9957 | 460  | 2.1588          | 41.2237 | 17.8161 | 33.7101 | 37.9569   | 30.18   |
-### Framework versions
-- Transformers 4.44.0
-- Pytorch 2.2.0+cu121
-- Datasets 2.20.0
-- Tokenizers 0.19.1

     - name: Rouge1
       type: rouge
       value: 42.3629
+library_name: transformers
+pipeline_tag: summarization
 ---
 # tFINE-base-300m-samsum
+An example fine-tune of [pszemraj/tFINE-base-300m](https://hf.co/pszemraj/tFINE-base-300m) for summarization using the samsum dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.9820
 - Rouge1: 42.3629
 - Rougelsum: 38.7792
 - Gen Len: 27.8033
+> [!NOTE]
+> The base model was pre-trained with CTX 1024 and fine-tuned on samsum with 1024 CTX inputs.
 ## Training procedure
 ### Training results
+> keep epoch 3 checkpt as final
 | Training Loss | Epoch  | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
 | 1.9528        | 0.9989 | 115  | 1.9189          | 40.093  | 18.2018 | 33.9749 | 36.9071   | 29.3333 |
 | 1.1696        | 2.9967 | 345  | 1.9820          | 42.3629 | 18.4285 | 34.6339 | 38.7792   | 27.8033 |
 | 0.9359        | 3.9957 | 460  | 2.1588          | 41.2237 | 17.8161 | 33.7101 | 37.9569   | 30.18   |