PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language
Paper
•
2505.10055
•
Published
•
1
NLP, Computer Vision, Low-resource languages NLP, Pashto, OCR, Text-to-Speech, Speech-to-Text