Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate Paper β’ 2410.07167 β’ Published Oct 9, 2024 β’ 40
Running on CPU Upgrade 13.2k 13.2k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots
Running 549 549 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects