ruohongz commited on
Commit
83dc690
·
verified ·
1 Parent(s): f676b8b

Upload /README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -3
README.md CHANGED
@@ -1,3 +1,48 @@
1
- This is the same model as in
2
- [Lin-Chen/open-llava-next-llama3-8b](https://huggingface.co/Lin-Chen/open-llava-next-llama3-8b).
3
- We back up that repo just for replicable experiments of our project.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ inference: false
4
+ datasets:
5
+ - Lin-Chen/ShareGPT4V
6
+ pipeline_tag: image-text-to-text
7
+ ---
8
+ <br>
9
+ <br>
10
+
11
+ # open-llava-next-llama3-8b Model Card
12
+
13
+ ## Model details
14
+
15
+ **Model type:**
16
+ open-llava-next-llama3-8b is an open-source implementation chatbot trained by fine-tuning the entire model on open-source [Open-LLaVA-Next-mix1M](https://huggingface.co/datasets/Lin-Chen/Open-LLaVA-NeXT-mix1M) data.
17
+
18
+ **Model date:**
19
+ open-llava-next-llama3-8b was trained in May 2024.
20
+
21
+ **Paper or resources for more information:**
22
+ [[Code](https://github.com/xiaoachen98/Open-LLaVA-NeXT)]
23
+
24
+ | Name | ViT | LLM | Weights | MME | SEED | SQA | MMB | MMB-CN | TextVQA | GQA |
25
+ |---|---|---|---|---|---|---|---|---|---|---|
26
+ | llava-next-vicuna-7b | CLIP-L-336 | Vicuna-7B | [HF](https://huggingface.co/liuhaotian/llava-v1.6-vicuna-7b) | 1519 | 70.2 | 70.1 | 67.4 | 60.6 | 64.9 | 64.2 |
27
+ | open-llava-next-vicuna-7b| CLIP-L-336 | Vicuna-7B | [HF](https://huggingface.co/Lin-Chen/open-llava-next-vicuna-7b) | 1540 | 71.1 | 70.7 | 68.5 | 60.7 | 67.2 | 64.3 |
28
+ | open-llava-next-llama3-8b| CLIP-L-336 | LLaMA3-8B | [HF](https://huggingface.co/Lin-Chen/open-llava-next-llama3-8b) | 1552 | 74.4 | 77.3 | 74.4 | 70.4 | 69.8 | 65.9 |
29
+
30
+ ## Usage
31
+
32
+ You can utilize this model as we provide in our [[repository](https://github.com/xiaoachen98/Open-LLaVA-NeXT)]. Moreover, you can direct load this model and use it in the [[LLaVA repository](https://github.com/haotian-liu/LLaVA)].
33
+
34
+ ## Training dataset
35
+
36
+ All training data are open-sourced in our [repository](https://github.com/xiaoachen98/Open-LLaVA-NeXT).
37
+
38
+ - [ShareGPT4V Mix665K](https://huggingface.co/datasets/Lin-Chen/ShareGPT4V)
39
+ - 200K GPT4V-generated instruction data (ALLaVA)
40
+ - 200K various VQA data
41
+
42
+ ## Intended use
43
+
44
+ **Primary intended uses:**
45
+ The primary use of open-llava-next-llama3-8b is research on large multimodal models and chatbots.
46
+
47
+ **Primary intended users:**
48
+ The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.