MVL-SIB: A Massively Multilingual Vision-Language Benchmark for Cross-Modal Topical Matching Paper • 2502.12852 • Published 19 days ago • 3
GIMMICK -- Globally Inclusive Multimodal Multitask Cultural Knowledge Benchmarking Paper • 2502.13766 • Published 18 days ago • 3
Why do LLaVA Vision-Language Models Reply to Images in English? Paper • 2407.02333 • Published Jul 2, 2024
M5 -- A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks Paper • 2407.03791 • Published Jul 4, 2024 • 1
Multilingual and Explainable Text Detoxification with Parallel Corpora Paper • 2412.11691 • Published Dec 16, 2024 • 1
Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Paper • 2501.05122 • Published Jan 9 • 20