ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 7 days ago • 64
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published 18 days ago • 28
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 6 days ago • 162