alif-munim commited on
Commit
be21fad
·
1 Parent(s): cf4cf2e

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -1
README.md CHANGED
@@ -11,7 +11,9 @@ should probably proofread and complete it, then remove this comment. -->
11
 
12
  # distilgpt2
13
 
14
- This model was trained from scratch on an unknown dataset.
 
 
15
 
16
  ## Model description
17
 
@@ -38,6 +40,15 @@ The following hyperparameters were used during training:
38
  - lr_scheduler_type: linear
39
  - num_epochs: 3.0
40
 
 
 
 
 
 
 
 
 
 
41
  ### Framework versions
42
 
43
  - Transformers 4.31.0
 
11
 
12
  # distilgpt2
13
 
14
+ This model was trained from scratch on the None dataset.
15
+ It achieves the following results on the evaluation set:
16
+ - Loss: 3.7487
17
 
18
  ## Model description
19
 
 
40
  - lr_scheduler_type: linear
41
  - num_epochs: 3.0
42
 
43
+ ### Training results
44
+
45
+ | Training Loss | Epoch | Step | Validation Loss |
46
+ |:-------------:|:-----:|:----:|:---------------:|
47
+ | No log | 1.0 | 282 | 3.7686 |
48
+ | 3.8868 | 2.0 | 564 | 3.7522 |
49
+ | 3.8868 | 3.0 | 846 | 3.7487 |
50
+
51
+
52
  ### Framework versions
53
 
54
  - Transformers 4.31.0