mmaaz60 commited on
Commit
791f726
β€’
1 Parent(s): bb7cdd4

Update README.md

Browse files

Updates model card

Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -1,3 +1,30 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ # πŸ‘οΈ GLaMM-RegCap-VG
5
+
6
+ ---
7
+ ## πŸ“ Description
8
+ GLaMM-RegCap-VG is the model specific to region-level captioning finetuned on Visual Genome. "RegCap-VG" indicates its specialization in region-level captioning with tuning on the Visual Genome dataset.
9
+
10
+
11
+ ## πŸ’» Download
12
+ To get started with GLaMM-FullScope, follow these steps:
13
+ ```
14
+ git lfs install
15
+ git clone https://huggingface.co/MBZUAI/GLaMM-RegCap-VG
16
+ ```
17
+
18
+ ## πŸ“š Additional Resources
19
+ - **GitHub Repository:** For training and updates: [GitHub - GLaMM](https://github.com/mbzuai-oryx/groundingLMM).
20
+ - **Project Page:** For a detailed overview and insights into the project, visit our [Project Page - GLaMM](https://mbzuai-oryx.github.io/groundingLMM/).
21
+
22
+ ## πŸ“œ Citations and Acknowledgments
23
+
24
+ ```bibtex
25
+ @article{hanoona2023GLaMM,
26
+ title={GLaMM: Pixel Grounding Large Multimodal Model},
27
+ author={Rasheed, Hanoona and Maaz, Muhammad and Shaji, Sahal and Shaker, Abdelrahman and Khan, Salman and Cholakkal, Hisham and Anwer, Rao M. and Xing, Eric and Yang, Ming-Hsuan and Khan, Fahad S.},
28
+ journal={ArXiv 2311.03356},
29
+ year={2023}
30
+ }