chunwoolee0
/

mt5_small_bongsoo_en_ko

text2text-generation

Generated from Trainer

Model card Files Files and versions

chunwoolee0 commited on Sep 3, 2023

Commit

3d6e239

·

1 Parent(s): 8b84700

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -84,8 +84,8 @@ The following hyperparameters were used during training:
 | 3.69          | 0.97  | 3000 | 2.8778          | 0.1662 | 0.0237 | 0.1647 | 0.4694    |
 The mT5 model of google cannot be used for Korean although it is trained over 101 languages. Finetuning
-using very large data set by bongsoo/news_talk_en_ko still yield results of garbage.
-Since GPU memories allowed for free use of colab is greatly limited, repeated fine-tunings are performed
 to obtain better results. Theoretically, this might give better results. But actual attempts fail to yield
 better results. Instead, the results become worse.  One should use other
 models like the ke-t5 by KETI(한국전자연구원).

 | 3.69          | 0.97  | 3000 | 2.8778          | 0.1662 | 0.0237 | 0.1647 | 0.4694    |
 The mT5 model of google cannot be used for Korean although it is trained over 101 languages. Finetuning
+using very large data set such as bongsoo/news_talk_en_ko still yield garbage.
+Since GPU memories allowed for free use in colab are greatly limited, repeated fine-tunings for the split datasets are performed
 to obtain better results. Theoretically, this might give better results. But actual attempts fail to yield
 better results. Instead, the results become worse.  One should use other
 models like the ke-t5 by KETI(한국전자연구원).