view article Article Featherless AI on Hugging Face Inference Providers 🔥 By sbrandeis and 5 others • Jun 12 • 45
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 487
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 2 days ago • 52
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others • Jan 27 • 56
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22 • 596k • • 1.02k
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 447
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others • Feb 21 • 174