Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions Paper • 2504.08531 • Published Apr 11