microsoft/Phi-3.5-vision-instruct Image-Text-to-Text β’ 4B β’ Updated Sep 26, 2024 β’ 1.07M β’ 690
Running 550 550 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects
internlm/internlm-xcomposer2d5-7b Visual Question Answering β’ Updated Jul 22, 2024 β’ 2.75k β’ 206
DAMO-NLP-SG/VideoLLaMA2-7B Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 2.79k β’ 41
TinyLlama/TinyLlama-1.1B-Chat-v1.0 Text Generation β’ 1B β’ Updated Mar 17, 2024 β’ 1.08M β’ 1.31k