MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 11 days ago • 38
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs Paper • 2401.11708 • Published Jan 22, 2024 • 31