GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI
Abstract
Much previous AI research has focused on developing monolithic models to maximize their intelligence and capability, with the primary goal of enhancing performance on specific tasks. In contrast, this paper explores an alternative approach: collaborative AI systems that use workflows to integrate models, data sources, and pipelines to solve complex and diverse tasks. We introduce GenAgent, an LLM-based framework that automatically generates complex workflows, offering greater flexibility and scalability compared to monolithic models. The core innovation of GenAgent lies in representing workflows with code, alongside constructing workflows with collaborative agents in a step-by-step manner. We implement GenAgent on the ComfyUI platform and propose a new benchmark, OpenComfy. The results demonstrate that GenAgent outperforms baseline approaches in both run-level and task-level evaluations, showing its capability to generate complex workflows with superior effectiveness and stability.
Community
We’re excited to share our latest work, GenAgent! This system leverages AI agents to create workflows automatically.
In particular, we use agents to generate ComfyUI workflows, allowing users to build complex generation pipelines using just natural language.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems (2024)
- Collaborative Evolving Strategy for Automatic Data-Centric Development (2024)
- Re-Thinking Process Mining in the AI-Based Agents Era (2024)
- WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks (2024)
- Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Great work! This is what i want to do. I have always thought that LLMs can understand json-formatted workflows and perhaps create new workflows.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper