File size: 1,149 Bytes
600b4cf d93da3f 600b4cf c353a30 00cb600 c353a30 7da3813 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
---
license: other
license_name: ml-mgie
license_link: https://github.com/apple/ml-mgie/blob/main/LICENSE.txt
---
# [ICLR'24] Guiding Instruction-based Image Editing via Multimodal Large Language Models
This repo contains [LLaVA-7B](https://huggingface.co/liuhaotian/LLaVA-Lightning-7B-delta-v1-1) and [pre-trained MGIE ckpt](https://docs-assets.developer.apple.com/ml-research/models/mgie/mgie_7b.tar.gz) (on IPr2Pr + MagicBrush) for [MGIE](https://huggingface.co/spaces/tsujuifu/ml-mgie)
<img src="https://raw.githubusercontent.com/apple/ml-mgie/main/mgie.png" width="60%" />
Please follow the [offical repo](https://github.com/apple/ml-mgie) and [ipynb](https://github.com/apple/ml-mgie/blob/main/demo.ipynb) to use it
<img src="https://raw.githubusercontent.com/apple/ml-mgie/main/demo.png" width="60%" />
```
@inproceedings{fu2024mgie,
author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan},
title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}},
booktitle = {International Conference on Learning Representations (ICLR)},
year = {2024}
}
```
|