rubenroy
/

GPT2-GCv2-100k

Model card Files Files and versions

rubenroy commited on Jul 22

Commit

d0f1ab0

·

verified ·

1 Parent(s): 050d7d3

Update README.md

Files changed (1) hide show

README.md +15 -3

README.md CHANGED Viewed

@@ -1,3 +1,15 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+datasets:
+- rubenroy/GammaCorpus-v2-100k
+language:
+- en
+tags:
+- gammacorpus
+---
+# GPT2 GammaCorpus v2 100k
+This is a GPT-2 language model fine-tuned on the GammaCorpus v2 - 100k dataset, which consists of 100,000 structured user-assistant conversational pairs. The model was initialised from the pretrained gpt2 weights and trained for 2 epochs using maximum sequence length 256, batch size 2 (with gradient accumulation) and a learning rate of 5e-5.
+The tokenizer used is the original GPT-2 tokenizer with the EOS token also used as the pad token. The training objective was causal language modeling.
+Link to training dataset: https://huggingface.co/datasets/rubenroy/GammaCorpus-v2-100k