Files
myWiki/concepts/agent-symbolic-learning.md
2026-06-01 10:46:01 +08:00

40 lines
1.6 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Agent Symbolic Learning (Agent 符号学习)"
created: 2026-05-29
updated: 2026-05-29
type: concept
tags: ["agent", "symbolic-learning", "optimization", "self-evolving"]
sources: ["https://arxiv.org/abs/2406.18532"]
---
# Agent Symbolic Learning
**Agent Symbolic Learning** 是 Zhou et al. (AIWaves, 2024) 提出的框架:将 Agent pipeline 建模为 [[symbolic-network|符号网络]],用模仿连接主义学习(反向传播 + 梯度下降)的方式**联合优化** Agent 的所有符号组件。
## 核心类比
| 连接主义学习 | Agent Symbolic Learning |
|:---|:---|
| 计算图 | Agent Pipeline |
| 层 + 权重 | 节点 + Prompts/Tools |
| 数值 Loss | [[language-loss\|Language Loss]] |
| 数值梯度 | [[language-gradient\|Language Gradients]] |
| BP 链式法则 | [[symbolic-backpropagation\|Symbolic Back-Propagation]] |
| 优化器 | Symbolic Optimizer (LLM) |
## 三阶段
1. **Forward Pass**: Agent 沿 pipeline 执行 → 轨迹
2. **Backward Pass**: Language Loss 从末节点向前传播 → 每个节点的 Language Gradients
3. **Weight Update**: Optimizer 根据 gradients 更新 prompts/tools/pipeline
## 历史意义
这是首次明确提出"模仿连接主义学习的反向传播和梯度下降来进行 Agent 符号优化"的工作。后续 [[yang-skillopt-2026|SkillOpt]]2026Microsoft在工程稳定性上深化[[heuristic-learning|Heuristic Learning]]OpenAI在范式层级上推广。
## 相关
- [[zhou-agent-symbolic-learning-2024]] — 原始论文
- [[symbolic-network]] — 核心抽象
- [[language-gradient]] — 核心机制