MambaVision: A Hybrid Mamba-Transformer Vision Backbone Paper • 2407.08083 • Published Jul 10, 2024 • 33
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models Paper • 2503.11224 • Published Mar 14 • 26
view article Article Welcome PaliGemma 2 – New vision language models by Google Dec 5, 2024 • 152
IDEA-Research/grounding-dino-base Zero-Shot Object Detection • Updated May 12, 2024 • 1.12M • 86
google/siglip-base-patch16-224 Zero-Shot Image Classification • Updated Sep 26, 2024 • 231k • 43
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 130
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15, 2024 • 177