tabbas97
/

distilbert-base-uncased-finetuned-pubmed-lora-trained-tabbas97

Generated from Trainer

Model card Files Files and versions Community

tabbas97 commited on May 18, 2024

Commit

7949fef

·

verified ·

1 Parent(s): 4c8ed3d

End of training

Files changed (1) hide show

README.md +22 -14

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the pubmed-summarization dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9835
 ## Model description
@@ -38,7 +38,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -48,18 +48,26 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 2.2356        | 0.4   | 500  | 2.0414          |
-| 2.161         | 0.8   | 1000 | 2.0307          |
-| 2.1446        | 1.2   | 1500 | 1.9946          |
-| 2.143         | 1.6   | 2000 | 2.0254          |
-| 2.1318        | 2.0   | 2500 | 1.9951          |
-| 2.133         | 2.4   | 3000 | 2.0143          |
-| 2.1321        | 2.8   | 3500 | 1.9991          |
-| 2.1268        | 3.2   | 4000 | 1.9789          |
-| 2.1169        | 3.6   | 4500 | 1.9736          |
-| 2.1254        | 4.0   | 5000 | 1.9745          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the pubmed-summarization dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9256
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 32
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 2.1986        | 0.1667 | 500  | 2.0156          |
+| 2.1414        | 0.3334 | 1000 | 1.9893          |
+| 2.1247        | 0.5002 | 1500 | 1.9770          |
+| 2.1106        | 0.6669 | 2000 | 1.9640          |
+| 2.103         | 0.8336 | 2500 | 1.9548          |
+| 2.0974        | 1.0003 | 3000 | 1.9519          |
+| 2.0874        | 1.1671 | 3500 | 1.9506          |
+| 2.0842        | 1.3338 | 4000 | 1.9470          |
+| 2.0799        | 1.5005 | 4500 | 1.9406          |
+| 2.0781        | 1.6672 | 5000 | 1.9363          |
+| 2.0763        | 1.8339 | 5500 | 1.9371          |
+| 2.0664        | 2.0007 | 6000 | 1.9311          |
+| 2.0717        | 2.1674 | 6500 | 1.9277          |
+| 2.0683        | 2.3341 | 7000 | 1.9247          |
+| 2.0622        | 2.5008 | 7500 | 1.9290          |
+| 2.0614        | 2.6676 | 8000 | 1.9170          |
+| 2.0614        | 2.8343 | 8500 | 1.9239          |
+| 2.0646        | 3.0010 | 9000 | 1.9211          |
 ### Framework versions