1. 📘 Topic and Domain: The paper introduces DeepAgent, an end-to-end deep reasoning agent that can autonomously use various tools and interact with environments, falling within the domain of AI agents and large language models.
2. 💡 Previous Research and New Ideas: Based on previous work in LLM-powered agents like ReAct and Plan-and-Solve, it proposes a novel unified reasoning process that integrates tool discovery and execution, moving away from predefined workflows.
3. ❓ Problem: The paper addresses the limitations of existing agent frameworks that rely on predefined workflows and limited tool sets, which constrains their ability to handle real-world tasks requiring flexible tool use and long-horizon interactions.
4. 🛠️ Methods: The paper implements autonomous memory folding to compress interaction history, uses brain-inspired memory architecture (episodic, working, and tool memories), and develops ToolPO - an end-to-end reinforcement learning strategy with LLM-simulated APIs.
5. 📊 Results and Evaluation: DeepAgent consistently outperformed baseline methods across eight benchmarks, including general tool-use tasks (ToolBench, API-Bank, TMDB, Spotify, ToolHop) and downstream applications (ALFWorld, WebShop, GAIA, HLE), demonstrating superior performance in both labeled-tool and open-set tool retrieval scenarios.