---
language:
- en
---
# Thinking with Generated Images
We introduce **Thinking with Generated Images**, where we enable a single LMM (Large Multimodal Model) to spontaneously generate and reason with intermediate visual thoughts via a native long-multimodal thought process.
This model supports vision generation with intermediate visual subgoals.
Please refer to [our github repo](https://github.com/GAIR-NLP/thinking-with-generated-images) for more information!