20260601
This commit is contained in:
39
concepts/agent-symbolic-learning.md
Normal file
39
concepts/agent-symbolic-learning.md
Normal file
@@ -0,0 +1,39 @@
|
||||
---
|
||||
title: "Agent Symbolic Learning (Agent 符号学习)"
|
||||
created: 2026-05-29
|
||||
updated: 2026-05-29
|
||||
type: concept
|
||||
tags: ["agent", "symbolic-learning", "optimization", "self-evolving"]
|
||||
sources: ["https://arxiv.org/abs/2406.18532"]
|
||||
---
|
||||
|
||||
# Agent Symbolic Learning
|
||||
|
||||
**Agent Symbolic Learning** 是 Zhou et al. (AIWaves, 2024) 提出的框架:将 Agent pipeline 建模为 [[symbolic-network|符号网络]],用模仿连接主义学习(反向传播 + 梯度下降)的方式**联合优化** Agent 的所有符号组件。
|
||||
|
||||
## 核心类比
|
||||
|
||||
| 连接主义学习 | Agent Symbolic Learning |
|
||||
|:---|:---|
|
||||
| 计算图 | Agent Pipeline |
|
||||
| 层 + 权重 | 节点 + Prompts/Tools |
|
||||
| 数值 Loss | [[language-loss\|Language Loss]] |
|
||||
| 数值梯度 | [[language-gradient\|Language Gradients]] |
|
||||
| BP 链式法则 | [[symbolic-backpropagation\|Symbolic Back-Propagation]] |
|
||||
| 优化器 | Symbolic Optimizer (LLM) |
|
||||
|
||||
## 三阶段
|
||||
|
||||
1. **Forward Pass**: Agent 沿 pipeline 执行 → 轨迹
|
||||
2. **Backward Pass**: Language Loss 从末节点向前传播 → 每个节点的 Language Gradients
|
||||
3. **Weight Update**: Optimizer 根据 gradients 更新 prompts/tools/pipeline
|
||||
|
||||
## 历史意义
|
||||
|
||||
这是首次明确提出"模仿连接主义学习的反向传播和梯度下降来进行 Agent 符号优化"的工作。后续 [[yang-skillopt-2026|SkillOpt]](2026,Microsoft)在工程稳定性上深化,[[heuristic-learning|Heuristic Learning]](OpenAI)在范式层级上推广。
|
||||
|
||||
## 相关
|
||||
|
||||
- [[zhou-agent-symbolic-learning-2024]] — 原始论文
|
||||
- [[symbolic-network]] — 核心抽象
|
||||
- [[language-gradient]] — 核心机制
|
||||
Reference in New Issue
Block a user