VLM Performance - a takara-ai Collection

takara-ai 's Collections

3D

Medical

Synthetic Data Generation

LLM Performance

Foundational Vision

VLM Performance

Autonomous Agents

Audio

VLM Performance

updated Jul 10, 2024

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

Note Use the BlindTest Eval benchmark for vision tasks that are easy for humans.