1. 📘 Topic and Domain: The paper focuses on skill-aware orchestration for routing agents in compound AI systems, specifically addressing multi-turn model routing and agent orchestration in the domain of large language models (LLMs).
2. 💡 Previous Research and New Ideas: The paper builds on existing model routing approaches (heuristic, discriminative, and RL-based methods like Router-R1 and ToolOrchestra) but proposes SkillOrchestra, which learns fine-grained skills from execution experience and models agent-specific competence rather than directly learning routing policies end-to-end.
3. ❓ Problem: The paper addresses limitations in current routing approaches: input-level routers make coarse query-level decisions ignoring evolving task requirements, and RL-trained orchestrators are expensive to adapt and suffer from routing collapse (repeatedly invoking one strong but costly option).
4. 🛠️ Methods: SkillOrchestra learns a reusable Skill Handbook from execution traces containing mode-level insights, fine-grained skills, and agent profiles, then performs skill-grounded routing by selecting agents based on required skills and explicit performance-cost trade-offs at deployment time.
5. 📊 Results and Evaluation: SkillOrchestra outperforms state-of-the-art RL-based orchestrators by up to 22.5% with 700× and 300× learning cost reduction compared to Router-R1 and ToolOrchestra respectively, achieving higher accuracy at lower cost across ten benchmarks while demonstrating better routing balance and transferability across orchestrator models.