Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI Paper β’ 2401.14019 β’ Published Jan 25 β’ 20
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants Paper β’ 2308.16884 β’ Published Aug 31, 2023 β’ 8
Genie: Achieving Human Parity in Content-Grounded Datasets Generation Paper β’ 2401.14367 β’ Published Jan 25 β’ 7