Multimodal VLMs - Until July'25 Collection Multimodal VLMs for Domain-Specific Tasks: OCR, Reasoning, and Captioning • 12 items • Updated 25 days ago • 3
prithivMLmods/Qwen2.5-VL-3B-Abliterated-Caption-it Image-Text-to-Text • 4B • Updated 5 days ago • 110 • 3