20260625:很多新内容
This commit is contained in:
42
concepts/trajectory-synthesis.md
Normal file
42
concepts/trajectory-synthesis.md
Normal file
@@ -0,0 +1,42 @@
|
||||
---
|
||||
title: "轨迹合成 — Trajectory Synthesis"
|
||||
created: 2026-06-19
|
||||
updated: 2026-06-19
|
||||
type: concept
|
||||
tags: [trajectory-synthesis, training-data, multi-agent-simulation, ace-router]
|
||||
sources:
|
||||
- https://arxiv.org/abs/2601.08276
|
||||
---
|
||||
|
||||
# 轨迹合成(Trajectory Synthesis)
|
||||
|
||||
## 定义
|
||||
|
||||
Trajectory Synthesis 是 ACE-Router 的第二阶段:通过**多 Agent 角色模拟**从候选图生成多轮交互轨迹,从中提取历史感知路由的监督信号。
|
||||
|
||||
## 四角色模拟
|
||||
|
||||
```
|
||||
Planner Agent → 基于采样子集设计对话蓝图和粗略执行计划
|
||||
User Agent → 按蓝图生成自然用户请求和跟进问题
|
||||
Assistant Agent → 执行推理,选择工具并调用
|
||||
Tool Agent → 模拟工具返回结果(无真实 API)
|
||||
```
|
||||
|
||||
## 设计优势
|
||||
|
||||
- **环境无关**:LLM 模拟执行结果,无需访问真实 API——可大规模扩展
|
||||
- **历史感知**:从轨迹中提取 (历史→路由决策) 对,而非静态 (查询→工具) 对
|
||||
- **结构化监督**:标记每步实际调用的候选作为 ground-truth label
|
||||
|
||||
## 产出
|
||||
|
||||
- 627 初始工具 → 2005 扩展工具(来自候选图)
|
||||
- 15,092 个训练样本(供 Qwen3-8B LoRA 微调)
|
||||
|
||||
## 参考
|
||||
|
||||
- [[candidate-graph|候选图]]
|
||||
- [[ace-router|ACE-Router]]
|
||||
- [[history-aware-routing|历史感知路由]]
|
||||
- [[yao-ace-router-2026|论文]]
|
||||
Reference in New Issue
Block a user