Running 317 317 Qwen2.5 Omni 7B Demo ๐ Generate text and speech responses from text, images, or audio input
Running on Zero 70 70 VLM R1 Referral Expression ๐ฌ Mark regions in images based on text descriptions
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct Sentence Similarity โข Updated about 16 hours ago โข 12.6k โข 79