--- language: - en --- # Thinking with Generated Images

We introduce **Thinking with Generated Images**, where we enable a single LMM (Large Multimodal Model) to spontaneously generate and reason with intermediate visual thoughts via a native long-multimodal thought process.

thinking-with-generated-images

This model supports vision generation with intermediate visual subgoals.

thinking-with-generated-images

Please refer to [our github repo](https://github.com/GAIR-NLP/thinking-with-generated-images) for more information!