SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published 1 day ago β’ 82
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. β’ 19 items β’ Updated Nov 22, 2024 β’ 73
openai/whisper-large-v3-turbo Automatic Speech Recognition β’ Updated Oct 4, 2024 β’ 10M β’ β’ 2.01k
Running on T4 1.05k 1.05k Open NotebookLM π Personalised Podcasts For All - Available in 13 Languages