MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published 21 days ago • 30
Agent S: An Open Agentic Framework that Uses Computers Like a Human Paper • 2410.08164 • Published 28 days ago • 24
WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration Paper • 2408.15978 • Published Aug 28
Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents Paper • 2409.17140 • Published Sep 25
AssistantX: An LLM-Powered Proactive Assistant in Collaborative Human-Populated Environment Paper • 2409.17655 • Published Sep 26
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper • 2410.13824 • Published 21 days ago • 29
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published 21 days ago • 40
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published 8 days ago • 43