Ukrainian Texts Classification

non-profit

AI & ML interests

Text Classification, Toxic Speech Detection, Hate Speech Detection, Multilingualism, Cross-lingual Classification Knowledge Transfer

Recent Activity

Text Classification datasets and models for Uktainian

We release datasets and models for Ukrainian covering several classification domains: toxicity, NLI, and formality.

Corresponding papers

[COLING2025] Daryna Dementieva, Valeriia Khylenko, and Georg Groh. 2025. Cross-lingual Text Classification Transfer: The Case of Ukrainian. In Proceedings of the 31st International Conference on Computational Linguistics, pages 1451–1464, Abu Dhabi, UAE. Association for Computational Linguistics.

[NAACL2024, WOAH] Daryna Dementieva, Valeriia Khylenko, Nikolay Babakov, and Georg Groh. 2024. Toxicity Classification in Ukrainian. In Proceedings of the 8th Workshop on Online Abuse and Harms (WOAH 2024), pages 244–255, Mexico City, Mexico. Association for Computational Linguistics.