osunlp
/

UGround

BoyuNLP commited on 29 days ago

Commit

132ea2c

•

1 Parent(s): 1380629

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -28,4 +28,25 @@ UGround is a storng GUI visual grounding model trained with a simple recipe. Che
   - [ ] Data Construction Scripts
   - [ ] Guidance of Open-source Data
   - [ ] Full Data
-- [x] Online Demo (HF Spaces)

   - [ ] Data Construction Scripts
   - [ ] Guidance of Open-source Data
   - [ ] Full Data
+- [x] Online Demo (HF Spaces)
+## Citation Information
+If you find this work useful, please consider citing our paper:
+```
+@article{gou2024uground,
+        title={Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents},
+        author={Boyu Gou and Ruohan Wang and Boyuan Zheng and Yanan Xie and Cheng Chang and Yiheng Shu and Huan Sun and Yu Su},
+        journal={arXiv preprint arXiv:2410.05243},
+        year={2024},
+        url={https://arxiv.org/abs/2410.05243},
+      }
+@article{zheng2023seeact,
+        title={GPT-4V(ision) is a Generalist Web Agent, if Grounded},
+        author={Boyuan Zheng and Boyu Gou and Jihyung Kil and Huan Sun and Yu Su},
+        journal={arXiv preprint arXiv:2401.01614},
+        year={2024},
+      }
+```