20260625:很多新内容
This commit is contained in:
69
papers/yao-ace-router-2026.md
Normal file
69
papers/yao-ace-router-2026.md
Normal file
@@ -0,0 +1,69 @@
|
||||
---
|
||||
title: "ACE-Router:历史感知路由"
|
||||
created: 2026-06-19
|
||||
updated: 2026-06-19
|
||||
type: paper
|
||||
tags: [router-training, history-aware, mcp, tool-selection, agent-web, candidate-graph]
|
||||
sources:
|
||||
- https://arxiv.org/abs/2601.08276
|
||||
- https://github.com/euyis1019/ACE-Router
|
||||
---
|
||||
|
||||
# ACE-Router:历史感知路由
|
||||
|
||||
> **Zhiyuan Yao, Zishan Xu, Yifu Guo 等** · 2026 · arXiv:2601.08276
|
||||
|
||||
## 核心问题
|
||||
|
||||
MCP 工具生态爆炸式增长,现有方案各有限制:
|
||||
- **静态注入**:上下文窗口有限,无法规模化
|
||||
- **Embedding 检索**:静态语义匹配,缺乏多轮历史感知
|
||||
- **通用 LLM**:推理强但缺乏精确工具辨识力
|
||||
|
||||
## 核心方案:训练一个 Router
|
||||
|
||||
ACE-Router 不从零推理——直接**训练一个专门的路由器**,将多轮对话历史对齐到正确的路由决策。
|
||||
|
||||
## 三阶段框架
|
||||
|
||||
### 1. [[candidate-graph|Candidate Graph + 自进化变异]]
|
||||
构建语义相似图 → 五种变异算子扩展候选空间 → 627→2005 工具
|
||||
|
||||
### 2. [[trajectory-synthesis|多 Agent 轨迹合成]]
|
||||
候选图采样 → 四角色模拟(Planner/User/Assistant/Tool Agent)→ 15,092 训练样本。环境无关:LLM 模拟执行,无需真实 API。
|
||||
|
||||
### 3. [[light-routing-agent|Light Routing Agent]]
|
||||
仅两个工具:`router_invoke` + `tool_execute`。路由与执行解耦,可插拔适配工具选择和 Agent 选择。
|
||||
|
||||
## 关键数据
|
||||
|
||||
| 指标 | ACE-Router | Best Baseline |
|
||||
|------|:---:|:---:|
|
||||
| MCP-Universe | **53.44%** | 49.79% (Gemini-2.5-Pro) |
|
||||
| MCP-Mark | **60.00%** | ~50% (ReAct) |
|
||||
| 扩展候选池 | **53.02%** (稳定) | 36.47% (ReAct 崩溃) |
|
||||
| 噪声环境 | **56.00%** | 32% (Gemini-2.5-Pro) |
|
||||
| 多 Agent 泛化 | **88-92%** | — (零训练迁移) |
|
||||
|
||||
**8B 专用路由器 > 巨型通用模型**(GPT-4o, Gemini-2.5-Pro)——证明了精确工具辨识不是靠扩大推理能力,而是靠专门训练。
|
||||
|
||||
## MCP 工具选择三篇之比较
|
||||
|
||||
| | [[fei-mcp-zero-2025|MCP-Zero]] | [[gaurav-dynamic-react-2025|Dynamic ReAct]] | ACE-Router |
|
||||
|---|---|---|---|---|
|
||||
| 机制 | 主动请求 + 层次路由 | meta-tools + 语义搜索 | **训练专用路由器** |
|
||||
| 历史感知 | 迭代请求(隐式) | ReAct 框架内 | **显式训练对齐** |
|
||||
| 规模适应 | 理论 O(m+k) | 工程验证 | **训练+噪声双重验证** |
|
||||
| 泛化 | MCP 工具 | MCP 工具 | **工具→Agent 零训练迁移** |
|
||||
|
||||
## 关键概念
|
||||
|
||||
- [[ace-router|ACE-Router 框架]]
|
||||
- [[history-aware-routing|历史感知路由]]
|
||||
- [[candidate-graph|候选图]]
|
||||
- [[self-evolutionary-mutation|自进化变异]]
|
||||
- [[trajectory-synthesis|轨迹合成]]
|
||||
- [[light-routing-agent|轻量路由 Agent]]
|
||||
- [[agent-web|Agent Web]]
|
||||
|
||||
来源:[原始存档](raw/papers/yao-ace-router-2026.md)
|
||||
Reference in New Issue
Block a user