Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining Paper • 2404.05428 • Published Apr 8, 2024
The ParlaSent-BCS dataset of sentiment-annotated parliamentary debates from Bosnia-Herzegovina, Croatia, and Serbia Paper • 2206.00929 • Published Jun 2, 2022
The ParlaSent multilingual training dataset for sentiment identification in parliamentary proceedings Paper • 2309.09783 • Published Sep 18, 2023 • 2