Update README.md
Browse files
README.md
CHANGED
@@ -44,11 +44,22 @@ tokenizer = RobertaTokenizer.from_pretrained('dsfsi/PuoBERTa')
|
|
44 |
|
45 |
### Downstream Use
|
46 |
|
47 |
-
|
48 |
## Dataset
|
49 |
|
50 |
We used the PuoData dataset, a rich source of Setswana text, ensuring that our model is well-trained and culturally attuned.\\
|
51 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
52 |
## Contributing
|
53 |
|
54 |
Your contributions are welcome! Feel free to improve the model.
|
|
|
44 |
|
45 |
### Downstream Use
|
46 |
|
|
|
47 |
## Dataset
|
48 |
|
49 |
We used the PuoData dataset, a rich source of Setswana text, ensuring that our model is well-trained and culturally attuned.\\
|
50 |
|
51 |
+
### Citation Information
|
52 |
+
|
53 |
+
Bibtex Refrence
|
54 |
+
|
55 |
+
```
|
56 |
+
@article{marivatePuoBERTa2023,
|
57 |
+
title={PuoBERTa: Training and evaluation of a curated language model for Setswana},
|
58 |
+
author={Vukosi Marivate and Moseli Mots'Oehli and Valencia Wagner and Richard Lastrucci and Isheanesu Dzingirai},
|
59 |
+
journal={ArXiv},
|
60 |
+
}
|
61 |
+
```
|
62 |
+
|
63 |
## Contributing
|
64 |
|
65 |
Your contributions are welcome! Feel free to improve the model.
|