Post
2684
introducing: VLM vibe eval πͺ
visionLMsftw/VLMVibeEval
vision LMs are saturated over benchmarks, so we built vibe eval π¬
> compare different models with refreshed in-the-wild examples in different categories π€
> submit your favorite model for eval
no numbers -- just vibes!
vision LMs are saturated over benchmarks, so we built vibe eval π¬
> compare different models with refreshed in-the-wild examples in different categories π€
> submit your favorite model for eval
no numbers -- just vibes!