JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation Paper • 2410.17250 • Published Oct 22, 2024 • 15
evborjnvioerjnvuowsetngboetgjbeigjaweuofjf/i-love-anime-sakuga Viewer • Updated Jan 7 • 1.49M • 468 • 19
What matters when building vision-language models? Paper • 2405.02246 • Published May 3, 2024 • 102