UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 191 items • Updated about 5 hours ago • 24
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Paper • 2409.20566 • Published Sep 30 • 52
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents Paper • 2408.06327 • Published Aug 12 • 15