view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 โข 74
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 โข 10 items โข Updated Apr 3 โข 55
LLaVA-ฯ: Efficient Multi-Modal Assistant with Small Language Model Paper โข 2401.02330 โข Published Jan 4, 2024 โข 17