20260617:目前有914 页
This commit is contained in:
43
concepts/procedural-skill-layer.md
Normal file
43
concepts/procedural-skill-layer.md
Normal file
@@ -0,0 +1,43 @@
|
||||
---
|
||||
title: "Procedural Skill Layer(程序技能层)"
|
||||
created: 2026-06-11
|
||||
updated: 2026-06-11
|
||||
type: concept
|
||||
tags: [agent, harness, skill, retrieval]
|
||||
sources: [raw/papers/xu-life-harness-runtime-adaptation-2026.md]
|
||||
---
|
||||
|
||||
# Procedural Skill Layer
|
||||
|
||||
> Life-Harness 的第②层:从训练轨迹中构建技能记忆库,在任务条件化阶段检索相关程序技能以指导决策。
|
||||
|
||||
## 机制
|
||||
|
||||
该层在任务描述 x 出现时触发(交互循环启动前),从技能记忆库 S 中检索相关技能:
|
||||
|
||||
```
|
||||
K_x = TopK_{k∈S} score(x, k)
|
||||
```
|
||||
|
||||
其中 score 通过 BM25 实现。检索到的技能 K_x 被注入到初始 system prompt 中,为模型提供**非参数的决策指导**。
|
||||
|
||||
## 技能的定义
|
||||
|
||||
在 Life-Harness 中,一个 Skill 是**紧凑且可复用的策略**,捕捉完成特定子任务的核心方法。技能从训练轨迹中蒸馏而来,不像模型参数那样需要梯度更新,而是以文本形式存储和检索。
|
||||
|
||||
## 与其他层的关系
|
||||
|
||||
- 在 [[environment-contract-layer|环境契约层]] 之后执行:先建立合法框架,再注入策略指导
|
||||
- 与 [[trajectory-regulation-layer|轨迹调控层]] 互补:技能层提供"正向引导",调控层提供"负向矫正"
|
||||
- 使 [[runtime-harness-adaptation|Runtime Harness Adaptation]] 具备从历史经验中学习的非参数化能力
|
||||
|
||||
## 关键特性
|
||||
|
||||
- **可审计**:技能以自然语言存储,人类可直接理解和验证
|
||||
- **可累积**:新轨迹持续丰富技能记忆库
|
||||
- **跨模型可复用**:技能是环境侧的策略知识,不绑定特定模型
|
||||
|
||||
## 参考
|
||||
- [[xu-life-harness|Life-Harness 论文]]
|
||||
- [[harness-evolution|Harness Evolution]]
|
||||
- [[cross-model-harness-transfer|跨模型 Harness 迁移]]
|
||||
Reference in New Issue
Block a user