1. 📘 Topic and Domain: The paper introduces ComfyUI-Copilot, an LLM-powered plugin designed to enhance usability and workflow development in ComfyUI, an open-source platform for AI art creation.
2. 💡 Previous Research and New Ideas: Previous research focused on workflow generation but had limitations like instability and narrow focus on text-to-image tasks; this paper introduces a multi-agent framework with broader capabilities and knowledge bases.
3. ❓ Problem: The paper addresses challenges faced by ComfyUI users, including limited documentation, model misconfigurations, and workflow design complexity.
4. 🛠️ Methods: The paper employs a hierarchical multi-agent framework with a central LLM-based assistant agent and specialized worker agents, supported by extensive knowledge bases covering nodes, models, and workflows.
5. 📊 Results and Evaluation: The system achieved high recall rates (>88.5%) for workflow and node recommendations, with online user feedback showing 85.9% acceptance rate for workflows and 65.4% for nodes, attracting 19K users across 22 countries.