Update README.md
Browse files
README.md
CHANGED
@@ -28,4 +28,25 @@ UGround is a storng GUI visual grounding model trained with a simple recipe. Che
|
|
28 |
- [ ] Data Construction Scripts
|
29 |
- [ ] Guidance of Open-source Data
|
30 |
- [ ] Full Data
|
31 |
-
- [x] Online Demo (HF Spaces)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
- [ ] Data Construction Scripts
|
29 |
- [ ] Guidance of Open-source Data
|
30 |
- [ ] Full Data
|
31 |
+
- [x] Online Demo (HF Spaces)
|
32 |
+
|
33 |
+
## Citation Information
|
34 |
+
|
35 |
+
If you find this work useful, please consider citing our paper:
|
36 |
+
|
37 |
+
```
|
38 |
+
@article{gou2024uground,
|
39 |
+
title={Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents},
|
40 |
+
author={Boyu Gou and Ruohan Wang and Boyuan Zheng and Yanan Xie and Cheng Chang and Yiheng Shu and Huan Sun and Yu Su},
|
41 |
+
journal={arXiv preprint arXiv:2410.05243},
|
42 |
+
year={2024},
|
43 |
+
url={https://arxiv.org/abs/2410.05243},
|
44 |
+
}
|
45 |
+
|
46 |
+
@article{zheng2023seeact,
|
47 |
+
title={GPT-4V(ision) is a Generalist Web Agent, if Grounded},
|
48 |
+
author={Boyuan Zheng and Boyu Gou and Jihyung Kil and Huan Sun and Yu Su},
|
49 |
+
journal={arXiv preprint arXiv:2401.01614},
|
50 |
+
year={2024},
|
51 |
+
}
|
52 |
+
```
|