Model Card for LGVI
Dataset Description
- Paper: https://arxiv.org/abs/2401.10226
- Project Page: https://jianzongwu.github.io/projects/rovi
- Github Repository: https://github.com/jianzongwu/Language-Driven-Video-Inpainting
Model Summary
The LGVI model is trained on ROVI and Inst-Inpaint for the referring inpainting task. Please check our project page for more details.
@article{wu2024lgvi,
title={Towards language-driven video inpainting via multimodal large language models},
author={Wu, Jianzong and Li, Xiangtai and Si, Chenyang and Zhou, Shangchen and Yang, Jingkang and Zhang, Jiangning and Li, Yining and Chen, Kai and Tong, Yunhai and Liu, Ziwei and others},
journal={arXiv preprint arXiv:2401.10226},
year={2024}
}
- Downloads last month
- 41
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.