This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
AI & ML interests
Multilingual NLP, underserved languages
Recent Activity
View all activity
Organization Card
Open Language Data Initiative
Welcome!
The Open Language Data Initiative (OLDI) empowers language communities around the globe to contribute to a database that drives the foundation of today’s machine translation and natural language processing work. We invite community, academic, and industry members to contribute to key datasets that are imperative to the organic expansion of language technology’s reach.
For more information, visit oldi.org.
models
0
None public yet