UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics โข 374 items โข Updated about 9 hours ago โข 54
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub โข 3 items โข Updated Nov 10, 2024 โข 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 611
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated Apr 30 โข 305
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper โข 2409.08264 โข Published Sep 12, 2024 โข 49
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). โข 8 items โข Updated Feb 21 โข 61
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma โข 16 items โข Updated 12 days ago โข 147
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐ Collection Papers about vision-language models, most important ones are on top of the list. โข 27 items โข Updated Apr 30, 2024 โข 36