Model Card

Bunny is a family of lightweight multimodal models.

Bunny-qwen1.5-1.8b-siglip-lora leverages Qwen1.5-1.8B as the language model backbone and SigLIP as the vision encoder. It is pretrained on LAION-2M and finetuned on Bunny-695K.

More details about this model can be found in GitHub.

License

This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.