neulab
/

Pangea-7B-hf

Safetensors

llava_next

Model card Files Files and versions Community

seungone commited on Oct 28

Commit

e9db639

•

1 Parent(s): 4618cf2

Update README.md

Browse files

Files changed (1) hide show

README.md +91 -3

README.md CHANGED Viewed

@@ -1,5 +1,79 @@
-The following is the code to run Pangea-7B using huggingface generate:
-```
 # Assuming that you have text_input and image_path
 from transformers import LlavaNextForConditionalGeneration, AutoProcessor
 import torch
@@ -21,4 +95,18 @@ output = output[0]
 result = processor.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=False)
 print(result)
-```

+---
+license: apache-2.0
+datasets:
+- neulab/PangeaInstruct
+language:
+- am
+- ar
+- bg
+- bn
+- cs
+- de
+- el
+- en
+- es
+- fa
+- fr
+- ga
+- hi
+- id
+- ig
+- it
+- iw
+- ja
+- jv
+- ko
+- nl
+- mn
+- ms
+- no
+- pl
+- pt
+- ro
+- ru
+- si
+- su
+- sw
+- ta
+- te
+- th
+- tr
+- uk
+- ur
+- vi
+- zh
+base_model:
+- Qwen/Qwen2-7B-Instruct
+---
+# Pangea-7B Model Card
+[Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages](https://neulab.github.io/Pangea/)
+🇪🇹 🇸🇦 🇧🇬 🇧🇩 🇨🇿 🇩🇪 🇬🇷 🇬🇧 🇺🇸 🇪🇸 🇮🇷 🇫🇷 🇮🇪 🇮🇳 🇮🇩 🇳🇬 🇮🇹 🇮🇱 🇯🇵 🇮🇩 🇰🇷 🇳🇱 🇲🇳 🇲🇾 🇳🇴 🇵🇱 🇵🇹 🇧🇷 🇷🇴 🇷🇺 🇱🇰 🇮🇩 🇰🇪 🇹🇿 🇱🇰 🇹🇭 🇹🇷 🇺🇦 🇵🇰 🇻🇳 🇨🇳 🇹🇼
+[🏠 Homepage](https://neulab.github.io/Pangea/) | [🤖 Pangea-7B](https://huggingface.co/neulab/Pangea-7B) | [📊 PangeaIns](https://huggingface.co/datasets/neulab/PangeaInstruct) | [🧪 PangeaBench](https://huggingface.co/collections/neulab/pangea-6713c3b0d78a453906eb2ed8) | [💻 Github](https://github.com/neulab/Pangea/tree/main) | [📄 Arxiv](https://arxiv.org/abs/2410.16153) | [📕 PDF](https://arxiv.org/pdf/2410.16153) | [🖥️ Demo](https://huggingface.co/spaces/neulab/Pangea)
+<img src="https://cdn-uploads.huggingface.co/production/uploads/6230d750d93e84e233882dbc/ZjVTKnIsyshWpo-PWg9gM.png" alt="description" style="width:300px;">
+## Model details
+ - **Model:** Pangea is a fully open-source Multilingual Multimodal Multicultural LLM.
+ - **Date:** Pangea-7B was trained in 2024.
+ - **Training Dataset:** [6M PangeaIns](https://huggingface.co/datasets/neulab/PangeaInstruct).
+ - **Architecture:** Pangea-7B follows the architecture of [LLaVA-NeXT](https://github.com/LLaVA-VL/LLaVA-NeXT), with a [Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) backbone.
+## Uses
+Pangea-7B follows the architecture of [LLaVA-NeXT](https://github.com/LLaVA-VL/LLaVA-NeXT).
+You could either (1) follow the same model loading procedures as of [LLaVA-NeXT](https://github.com/LLaVA-VL/LLaVA-NeXT), an example of loading Pangea-7B directly is shown in the Python code below, or (2) use our hf version of Pangea-7B: [Pangea-7B-hf]https://huggingface.co/neulab/Pangea-7B-hf
+### Direct Use
+The hf version is intended so that you could use Pangea-7B with the huggingface generate function.
+If you want to use it with the Llava-Next codebase, please refer to our [original checkpoint](https://huggingface.co/neulab/Pangea-7B).
+```python
 # Assuming that you have text_input and image_path
 from transformers import LlavaNextForConditionalGeneration, AutoProcessor
 import torch
 result = processor.decode(output, skip_special_tokens=True, clean_up_tokenization_spaces=False)
 print(result)
+```
+## Citing the Model
+**BibTeX Citation:**
+```
+@article{yue2024pangeafullyopenmultilingual,
+  title={Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages},
+  author={Xiang Yue and Yueqi Song and Akari Asai and Seungone Kim and Jean de Dieu Nyandwi and Simran Khanuja and Anjali Kantharuban and Lintang Sutawika and Sathyanarayanan Ramamoorthy and Graham Neubig},
+  year={2024},
+  journal={arXiv preprint arXiv:2410.16153},
+  url={https://arxiv.org/abs/2410.16153}
+}
+```