Mir-2002
/

codet5p-google-style-docstrings

Model card Files Files and versions Community

Mir-2002 commited on Jun 25

Commit

3b0299d

·

verified ·

1 Parent(s): 9d5aa9c

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -96,6 +96,8 @@ WEIGHT_DECAY = 0.01 <br>
 OPTIMIZER = ADAFACTOR <br>
 LR_SCHEDULER = LINEAR <br>
 # Loss
 On the 35th epoch, the model achieved the following loss:

 OPTIMIZER = ADAFACTOR <br>
 LR_SCHEDULER = LINEAR <br>
+The model was trained on via Colab Pro, on an L4 GPU. A gradient accumulation step of 4 was used to simulate an effective batch size of 64 (16 * 4).
 # Loss
 On the 35th epoch, the model achieved the following loss: