70 lines
2.5 KiB
Markdown
70 lines
2.5 KiB
Markdown
---
|
||
title: "策略基因 (Strategy Gene)"
|
||
created: 2026-06-14
|
||
updated: 2026-06-14
|
||
type: concept
|
||
tags: [agent, experience-reuse, test-time-control, representation]
|
||
sources: [raw/papers/procedural-skills-to-strategy-genes-2026.md]
|
||
---
|
||
|
||
# 策略基因 (Strategy Gene)
|
||
|
||
由 Wang et al. (2026) 在 [[procedural-skills-to-strategy-genes|Skills to Strategy Genes]] 中提出:一种**紧凑的、控制导向的经验表示**,从先前的问题解决经验中蒸馏而来。
|
||
|
||
## 定义
|
||
|
||
策略基因是一种控制导向的可复用经验表示,其目标不是文档完整性,而是**紧凑性、结构清晰性、行为导向性和失败感知性**。
|
||
|
||
形式定义:给定技能包 s 或轨迹集 H,基因通过蒸馏映射获得:
|
||
```
|
||
g = psi(s) 或 g = psi(H), g in G
|
||
```
|
||
|
||
基因结构:`g = (m, u, pi, alpha, c, v)`
|
||
- **m**: 任务匹配信号(关键词/触发线索)
|
||
- **u**: 紧凑摘要(一句话意图描述)
|
||
- **pi**: 策略步骤(含 AVOID 项)
|
||
- **alpha**: 失败感知的 AVOID 线索
|
||
- **c**: 可选执行约束
|
||
- **v**: 可选验证钩子
|
||
|
||
## 操作化形式
|
||
|
||
```
|
||
<strategy-gene>
|
||
Domain keywords: uv-vis, peak detection, FWHM, unit conversion
|
||
Summary: Detect peaks and compute wavelength-domain peak properties correctly
|
||
Strategy:
|
||
1. Detect peaks with prominence-based criteria
|
||
2. Convert min_distance into sample-index units before peak detection
|
||
3. AVOID: Report FWHM only after converting peak_widths outputs back to wavelength units
|
||
</strategy-gene>
|
||
```
|
||
|
||
典型规模约 230 tokens。
|
||
|
||
## 与过程技能的区别
|
||
|
||
| 维度 | 策略基因 (Gene) | 过程技能 (Skill) |
|
||
|------|----------------|-----------------|
|
||
| 目标 | 控制导向 | 文档导向 |
|
||
| 规模 | ~230 tokens | ~2,500 tokens |
|
||
| 组织逻辑 | 控制逻辑 | 文档逻辑 |
|
||
| 优先级 | 信号密度、适用范围、失败显著性 | 可读性、完整性 |
|
||
|
||
策略基因**不是技能的缩短版**,而是对可复用经验的**不同抽象**。
|
||
|
||
## 关键发现
|
||
|
||
- Gene 整体 +3.0pp vs 基线,Skill -1.1pp
|
||
- Gene 对内容扰动敏感(错误算法→-5.2pp),但对结构扰动鲁棒(逆序仍保持 +2.8pp)
|
||
- 重新添加文档材料(API notes, examples)通常削弱而非增强 Gene
|
||
- 单一靶向 Gene 优于多 Gene 组合
|
||
|
||
## 参考
|
||
|
||
- [[procedural-skills-to-strategy-genes|Skills to Strategy Genes]] — 提出论文
|
||
- [[gene-evolution-protocol|GEP 协议]] — Gene 的协议化层
|
||
- [[procedural-skill|过程技能]] — 对比对象
|
||
- [[experience-distillation|经验蒸馏]] — Gene 的提取过程
|