This checkpoint is copied from https://drive.google.com/drive/folders/1VotCNmdevvtMuJmdxPfg3MOZXJRnV96D This repo is created for convenient huggingface\_hub loading. The original LICENSE is included here. Citation: ``` @article{xvlm, title={Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts}, author={Zeng, Yan and Zhang, Xinsong and Li, Hang}, journal={arXiv preprint arXiv:2111.08276}, year={2021} } ```