# UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

## This repo include the checkpoints for UniAnimate:

- "models/dw-ll_ucoco_384.onnx": the checkpoint for dwpose extraction.

- "models/open_clip_pytorch_model.bin": the checkpoint for clip embedding.

- "models/unianimate_16f_32f_non_ema_223000.pth": the checkpoint for human image animation in UniAnimate (16/32 frames).

- "models/yolox_l.onnx": the checkpoint for dwpose extraction.

- "models/v2-1_512-ema-pruned.ckpt": the checkpoint for Stable Diffusion.


## BibTeX

If this repo is useful to you, please cite our corresponding technical paper.

```bibtex
@article{wang2024unianimate,
      title={UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation},
      author={Wang, Xiang and Zhang, Shiwei and Gao, Changxin and Wang, Jiayu and Zhou, Xiaoqiang and Zhang, Yingya and Yan, Luxin and Sang, Nong},
      journal={arXiv preprint arXiv:2406.01188},
      year={2024}
}
@inproceedings{TFT2V,
      title={A Recipe for Scaling up Text-to-Video Generation with Text-free Videos},
      author={Wang, Xiang and Zhang, Shiwei and Yuan, Hangjie and Qing, Zhiwu and Gong, Biao and Zhang, Yingya and Shen, Yujun and Gao, Changxin and Sang, Nong},
      booktitle={CVPR},
      year={2024}
}
@article{VideoComposer,
      title={VideoComposer: Compositional Video Synthesis with Motion Controllability},
      author={Wang, Xiang and Yuan, Hangjie and Zhang, Shiwei and Chen, Dayou and Wang, Jiuniu and Zhang, Yingya and Shen, Yujun and Zhao, Deli and Zhou, Jingren},
      journal={NeurIPS},
      year={2023}
}
```