pszemraj
/

long-t5-tglobal-xl-16384-book-summary

text2text-generation

Model card Files Files and versions Community

pszemraj commited on Nov 27, 2022

Commit

34f99d2

·

1 Parent(s): b04b6bd

Update README.md

Files changed (1) hide show

README.md +16 -36

README.md CHANGED Viewed

@@ -15,42 +15,7 @@ datasets:
 metrics:
 - rouge
 inference: false
-model-index:
-- name: pszemraj/long-t5-tglobal-xl-16384-book-summary
-  results:
-  - task:
-      type: summarization
-      name: Summarization
-    dataset:
-      name: kmfoda/booksum
-      type: kmfoda/booksum
-      config: kmfoda--booksum
-      split: test
-    metrics:
-    - name: ROUGE-1
-      type: rouge
-      value: 34.0139
-      verified: true
-    - name: ROUGE-2
-      type: rouge
-      value: 7.0901
-      verified: true
-    - name: ROUGE-L
-      type: rouge
-      value: 17.0925
-      verified: true
-    - name: ROUGE-LSUM
-      type: rouge
-      value: 31.6899
-      verified: true
-    - name: loss
-      type: loss
-      value: 2.096665382385254
-      verified: true
-    - name: gen_len
-      type: gen_len
-      value: 377.109
-      verified: true
 ---
 # long-t5-tglobal-xl + BookSum
@@ -136,6 +101,21 @@ Official results with the [model evaluator](https://huggingface.co/spaces/autoev
 - eval_samples_per_second: 0.107
 - eval_steps_per_second: 0.027
 ---
 ## FAQ

 metrics:
 - rouge
 inference: false
 ---
 # long-t5-tglobal-xl + BookSum
 - eval_samples_per_second: 0.107
 - eval_steps_per_second: 0.027
+```
+***** predict/test metrics (initial) *****
+  predict_gen_len            =   506.4368
+  predict_loss               =      2.028
+  predict_rouge1             =    36.8815
+  predict_rouge2             =     8.0625
+  predict_rougeL             =    17.6161
+  predict_rougeLsum          =    34.9068
+  predict_runtime            = 2:04:14.37
+  predict_samples            =       1431
+  predict_samples_per_second =      0.192
+  predict_steps_per_second   =      0.048
+```
+\* evaluating big model not as easy as it seems. Doing a bit more investigating
 ---
 ## FAQ