Multi Lingual OCR models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Nayana - Vision AI for all

Enabling Vision Language Capabilites for Low resource langauges
Initiative by Cognitivelab
Problem Statement
Despite advancements in vision-language AI, a significant number of the world's languages remain underserved, leaving millions without tools to process documents in their native scripts.
Challenges Addressed by Nayana:
- Wide Language Gap: Lack of robust OCR solutions for a large spectrum of languages, particularly low-resource and rare languages.
- Script Complexity: Supporting diverse writing systems, including those with intricate scripts, cursive styles, or mixed-language content.
- Scalability: Need for adaptable models that can handle real-world multilingual document processing at scale.
Nayana is designed to tackle these challenges by fine-tuning cutting-edge OCR models for diverse languages across multiple regions, empowering users to extract actionable insights from their documents regardless of the language or script.
Vision
To democratize access to Vision-Language AI for all communities by empowering a wide range of languages, including low-resource and underrepresented ones, with cutting-edge OCR and document understanding capabilities.
Mission
- Enhance Accessibility: Build tools that enable equitable AI solutions for diverse linguistic groups worldwide.
- Expand Language Coverage: Support a vast range of languages and scripts, breaking barriers for multilingual document processing.
- Foster Collaboration: Provide an open-source platform where developers and researchers can enhance and expand multilingual OCR capabilities.
models
19

Nayana-cognitivelab/Full-SFT-v1-23000
Image-Text-to-Text
•
8B
•
Updated
•
9

Nayana-cognitivelab/Full-SFT-v1-3500
Image-Text-to-Text
•
8B
•
Updated
•
15

Nayana-cognitivelab/Full-SFT-v1-3000
Image-Text-to-Text
•
8B
•
Updated
•
15

Nayana-cognitivelab/NayanaVQA
Image-Text-to-Text
•
8B
•
Updated
•
7

Nayana-cognitivelab/NayanaSectionOCR
Image-Text-to-Text
•
8B
•
Updated
•
70

Nayana-cognitivelab/DocOCR_SFT_v1_50
Image-Text-to-Text
•
8B
•
Updated
•
18
•
1

Nayana-cognitivelab/exp-colpali-merged-en-20k
3B
•
Updated
•
5

Nayana-cognitivelab/exp-colpali-trained-en-20k-lora
Updated
•
4

Nayana-cognitivelab/exp-colpali-merged-hi-en-20k
3B
•
Updated
•
8

Nayana-cognitivelab/exp-colpali-trained-hi-en-20k-lora
Updated
•
5
datasets
121
Nayana-cognitivelab/ViViD_arxiv
Viewer
•
Updated
•
95.4k
•
22
Nayana-cognitivelab/SectionOCR-SFT-augment
Viewer
•
Updated
•
226k
•
57
Nayana-cognitivelab/DocOCR-SFT-augment
Viewer
•
Updated
•
91.5k
•
32
Nayana-cognitivelab/DocOCR-SFT-v2
Viewer
•
Updated
•
1k
•
13
Nayana-cognitivelab/DocOCR-SFT-v1
Viewer
•
Updated
•
1k
•
8
Nayana-cognitivelab/SectionOCR-SFT-augment-archive
Viewer
•
Updated
•
226k
•
27
Nayana-cognitivelab/VQA-SFT
Viewer
•
Updated
•
557k
•
60
Nayana-cognitivelab/VQA-SFT-test
Viewer
•
Updated
•
377
•
122
Nayana-cognitivelab/DocOCR-SFT
Viewer
•
Updated
•
229k
•
63
Nayana-cognitivelab/SectionOCR-SFT
Viewer
•
Updated
•
656k
•
81