Files
myWiki/concepts/trajectory-synthesis.md

43 lines
1.3 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "轨迹合成 — Trajectory Synthesis"
created: 2026-06-19
updated: 2026-06-19
type: concept
tags: [trajectory-synthesis, training-data, multi-agent-simulation, ace-router]
sources:
- https://arxiv.org/abs/2601.08276
---
# 轨迹合成Trajectory Synthesis
## 定义
Trajectory Synthesis 是 ACE-Router 的第二阶段:通过**多 Agent 角色模拟**从候选图生成多轮交互轨迹,从中提取历史感知路由的监督信号。
## 四角色模拟
```
Planner Agent → 基于采样子集设计对话蓝图和粗略执行计划
User Agent → 按蓝图生成自然用户请求和跟进问题
Assistant Agent → 执行推理,选择工具并调用
Tool Agent → 模拟工具返回结果(无真实 API
```
## 设计优势
- **环境无关**LLM 模拟执行结果,无需访问真实 API——可大规模扩展
- **历史感知**:从轨迹中提取 (历史→路由决策) 对,而非静态 (查询→工具) 对
- **结构化监督**:标记每步实际调用的候选作为 ground-truth label
## 产出
- 627 初始工具 → 2005 扩展工具(来自候选图)
- 15,092 个训练样本(供 Qwen3-8B LoRA 微调)
## 参考
- [[candidate-graph|候选图]]
- [[ace-router|ACE-Router]]
- [[history-aware-routing|历史感知路由]]
- [[yao-ace-router-2026|论文]]