IndicXTREME Collection IndicXTREME is a human-supervised benchmark of 9 diverse NLU tasks across 20 languages, featuring 105 evaluation sets in total. β’ 8 items β’ Updated Oct 23, 2024 β’ 1
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance Paper β’ 2310.14572 β’ Published Oct 23, 2023 β’ 1
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance Paper β’ 2310.14572 β’ Published Oct 23, 2023 β’ 1
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation Paper β’ 2503.15222 β’ Published Mar 19 β’ 1
Airavata Evaluation Suite Collection A collection of benchmarks used for evaluation of Airavata, an Hindi instruction-tuned model on top of Sarvam's OpenHathi base model. β’ 22 items β’ Updated Oct 15, 2024 β’ 8