Pamzyy
/

sinhala_gpt2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pamzyy commited on Sep 2, 2024

Commit

a69908f

·

verified ·

1 Parent(s): 75c3251

Model save

Files changed (1) hide show

README.md +1 -13

README.md CHANGED Viewed

@@ -15,8 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
 # sinhala_gpt2
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.6139
 ## Model description
@@ -44,20 +42,10 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 60
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss |
-|:-------------:|:-------:|:----:|:---------------:|
-| 3.3922        | 7.3529  | 500  | 0.9294          |
-| 0.862         | 14.7059 | 1000 | 0.7480          |
-| 0.7413        | 22.0588 | 1500 | 0.6813          |
-| 0.6795        | 29.4118 | 2000 | 0.6468          |
-| 0.642         | 36.7647 | 2500 | 0.6274          |
-| 0.6187        | 44.1176 | 3000 | 0.6192          |
-| 0.607         | 51.4706 | 3500 | 0.6149          |
-| 0.602         | 58.8235 | 4000 | 0.6139          |
 ### Framework versions

 # sinhala_gpt2
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 5
 ### Training results
 ### Framework versions