[ICLR'24] Guiding Instruction-based Image Editing via Multimodal Large Language Models
This repo contains LLaVA-7B and pre-trained MGIE ckpt (on IPr2Pr + MagicBrush) for MGIE
Please follow the offical repo and ipynb to use it
@inproceedings{fu2024mgie,
author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan},
β title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}},
β booktitle = {International Conference on Learning Representations (ICLR)},
β year = {2024}
}