view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 17 days ago • 140
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 426
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.25k
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 8 days ago • 86
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 8 days ago • 56
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 8 days ago • 148
Turkish Vision-Language Datasets Collection Collection of Turkish vision-language datasets. • 27 items • Updated Mar 31 • 9
view article Article Assisted Generation: a new direction toward low-latency text generation By joaogante • May 11, 2023 • 64
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others • Sep 25, 2024 • 188
view article Article Fine-Tune ViT for Image Classification with 🤗 Transformers By nateraw • Feb 11, 2022 • 49
view article Article CodeGemma - an official Google release for code LLMs By pcuenq and 5 others • Apr 9, 2024 • 101