olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 8 days ago • 101
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 23 days ago • 68
Global Exams: Bangladesh (Localized MMLU) [ICLR'25] Collection Exams dataset in Bangladesh (Bengali, English) • 4 items • Updated 21 days ago • 1
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated 25 days ago • 25
Global Multimodal Exams: Bangladesh (Localized MMMU) Collection Vision-language Exams dataset in Bangladesh (Bengali, English) • 7 items • Updated Feb 21
Retrieval-Augmented Generation [EMNLP'24] Collection Artifacts for "Open-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models" [EMNLP 2024 Findings] • 5 items • Updated Feb 20 • 2