giantkylin
/

my_eli5_clm-model

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

giantkylin commited on Oct 26, 2023

Commit

0429d08

·

1 Parent(s): aaa57b2

End of training

Files changed (2) hide show

README.md +6 -6
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.4561
 ## Model description
@@ -44,11 +44,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step   | Validation Loss |
-|:-------------:|:-----:|:------:|:---------------:|
-| 3.6           | 1.0   | 37821  | 3.5046          |
-| 3.5554        | 2.0   | 75642  | 3.4672          |
-| 3.5097        | 3.0   | 113463 | 3.4561          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7619
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 3.8641        | 1.0   | 1103 | 3.7760          |
+| 3.7704        | 2.0   | 2206 | 3.7635          |
+| 3.7314        | 3.0   | 3309 | 3.7619          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:06b5a491755e5c25224133411b48333c69c1d7685f885fa06b644e168b147167
 size 327675282

 version https://git-lfs.github.com/spec/v1
+oid sha256:87afc9a94c4c72b038468de579f961da22935ea2f695809c61bb27b1761b8cae
 size 327675282