Running 2.67k 2.67k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 484
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Apr 30 • 72
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett • Sep 27, 2024 • 46
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 11 days ago • 147