AI & ML interests

Text Classification, Toxic Speech Detection, Hate Speech Detection, Multilingualism, Cross-lingual Classification Knowledge Transfer

Recent Activity

dardemĀ  updated a collection about 1 month ago
Ukrainian Emotions Classification
dardemĀ  updated a Space about 1 month ago
ukr-detect/README
dardemĀ  updated a model about 1 month ago
ukr-detect/ukr-emotions-classifier
View all activity

Text Classification datasets and models for Uktainian

We release datasets and models for Ukrainian covering several classification domains: toxicity, NLI, and formality.

šŸ“° Toloka BlogPost on Toxicity Classification in Ukrainian

Corresponding papers

[2025] Part of SemEval2025 Emotion Detection Shared Task; Daryna Dementieva, Nikolay Babakov, and Alexander Fraser. EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian. arXiv preprint arXiv:2505.23297.

[COLING2025] Daryna Dementieva, Valeriia Khylenko, and Georg Groh. 2025. Cross-lingual Text Classification Transfer: The Case of Ukrainian. In Proceedings of the 31st International Conference on Computational Linguistics, pages 1451–1464, Abu Dhabi, UAE. Association for Computational Linguistics.

[NAACL2024, WOAH] Daryna Dementieva, Valeriia Khylenko, Nikolay Babakov, and Georg Groh. 2024. Toxicity Classification in Ukrainian. In Proceedings of the 8th Workshop on Online Abuse and Harms (WOAH 2024), pages 244–255, Mexico City, Mexico. Association for Computational Linguistics.