20260601
This commit is contained in:
48
concepts/skill-as-external-state.md
Normal file
48
concepts/skill-as-external-state.md
Normal file
@@ -0,0 +1,48 @@
|
||||
---
|
||||
title: "Skill as External State (Skill 作为外部状态)"
|
||||
created: 2026-05-29
|
||||
updated: 2026-05-29
|
||||
type: concept
|
||||
tags: ["agent", "skill", "adaptation", "philosophy"]
|
||||
sources: ["https://arxiv.org/abs/2605.23904"]
|
||||
---
|
||||
|
||||
# Skill as External State (Skill 作为外部状态)
|
||||
|
||||
**Skill as External State** 是 [[skillopt|SkillOpt]] 的核心哲学:Agent 的适应不一定要改模型权重——**skill 文档本身就是一个可训练的"外部状态"**。
|
||||
|
||||
## 哲学转变
|
||||
|
||||
| 传统观点 | SkillOpt 观点 |
|
||||
|----------|---------------|
|
||||
| Adaptation = 改模型权重 | Adaptation = 优化外部文本状态 |
|
||||
| Skill 是静态配置 | Skill 是可训练的 artifact |
|
||||
| 优化只能用梯度 | 优化可以在文本空间中进行 |
|
||||
|
||||
## 为什么 text-as-state 可行
|
||||
|
||||
1. **Portable**:skill 是 Markdown 文件,可以跨模型/跨 harness 迁移
|
||||
2. **Inspectable**:人类可阅读、可审计
|
||||
3. **Compact**:300–2,000 tokens,不影响推理效率
|
||||
4. **Trainable**:可以用 [[text-space-optimizer|text-space optimizer]] 系统优化
|
||||
5. **Deployable**:训练后的 `best_skill.md` 可直接部署,无需模型权重更新
|
||||
|
||||
## 与 Model-Harness 关系
|
||||
|
||||
Skill 作为外部状态,位于 Model 和 Harness 之间:
|
||||
|
||||
```
|
||||
Model (frozen) ← Skill (trainable) ← Harness (execution)
|
||||
```
|
||||
|
||||
这呼应了 [[model-harness-relationship|Model-Harness 关系演进]] 中的核心理念——适应层从"模型权重"转移到"外部结构"。
|
||||
|
||||
## 与 Heuristic Learning 的联系
|
||||
|
||||
[[heuristic-learning|Heuristic Learning]] 将优化主体从 Model 参数扩展到 Agent 整体。SkillOpt 更进一步将 Agent 整体分解为 frozen model + trainable skill,使优化目标更精确、更可控。
|
||||
|
||||
## 相关
|
||||
|
||||
- [[skillopt]] — 将这一哲学工程化的方法
|
||||
- [[text-space-optimizer]] — 文本空间优化的范式
|
||||
- [[model-harness-relationship]] — Model-Harness-Skill 的三元关系
|
||||
Reference in New Issue
Block a user