HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs Paper β’ 2503.02003 β’ Published 9 days ago β’ 42
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper β’ 2502.09696 β’ Published 27 days ago β’ 39
Running 543 543 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
VideoGameBunny: Towards vision assistants for video games Paper β’ 2407.15295 β’ Published Jul 21, 2024 β’ 22