jianzongwu
/

lgvi

StableDiffusionPipeline

Inference Endpoints

Model card Files Files and versions Community

Model Card for LGVI

Dataset Description

Paper: https://arxiv.org/abs/2401.10226
Project Page: https://jianzongwu.github.io/projects/rovi
Github Repository: https://github.com/jianzongwu/Language-Driven-Video-Inpainting

Model Summary

The LGVI model is trained on ROVI and Inst-Inpaint for the referring inpainting task. Please check our project page for more details.

@article{wu2024lgvi,
  title={Towards language-driven video inpainting via multimodal large language models},
  author={Wu, Jianzong and Li, Xiangtai and Si, Chenyang and Zhou, Shangchen and Yang, Jingkang and Zhang, Jiangning and Li, Yining and Chen, Kai and Tong, Yunhai and Liu, Ziwei and others},
  journal={arXiv preprint arXiv:2401.10226},
  year={2024}
}

Downloads last month: 41

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.