VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published 11 days ago • 29
Running 2.49k 2.49k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Collaborative Instance Navigation: Leveraging Agent Self-Dialogue to Minimize User Input Paper • 2412.01250 • Published Dec 2, 2024 • 5
Collaborative Instance Navigation: Leveraging Agent Self-Dialogue to Minimize User Input Paper • 2412.01250 • Published Dec 2, 2024 • 5
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Mar 13 • 302