FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated Sep 2 • 102
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • Jun 3 • 261
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated Sep 1 • 127
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 536
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.3k