Files
myWiki/concepts/agent-harness-engineering.md
2026-06-01 10:46:01 +08:00

35 lines
1.3 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Agent Harness EngineeringAgent 执行骨架工程)"
created: 2026-05-23
updated: 2026-05-23
type: concept
tags: [agent, infrastructure, harness, production]
sources: [raw/papers/agent-harness-engineering-survey-2026.md]
confidence: high
---
# Agent Harness Engineering
> Agent Harness 是包裹 LLM 的基础设施层,负责管理长时间、多步骤任务执行的执行环境、工具接入、上下文、编排、可观测性、验证和治理。
## 核心定义
Agent Harness 不是 Agent 框架(开发工具),也不是 Agent 平台(产品 SaaS而是**使 Agent 可靠运行的底层控制平面**。它将执行、工具、上下文、编排、可观测性、验证和治理七个维度统一为一个工程表面。
## 为什么 Harness 比模型更关键?
- Bölük (2026a):仅改变 harness 格式就能同时提升 15 个 LLM 的编程能力
- Anthropic (2026a):基础设施设置可测量地改变 benchmark 分数
- 生产部署中,失败更多源自 harness 配置错误而非模型推理错误
## 相关概念
- [[etclovg-taxonomy]] — 七层分类体系
- [[binding-constraint-thesis]] — 约束瓶颈论
- [[prompt-to-harness-evolution]] — 三阶段工程演进
- [[harness-coupling-problem]] — 跨层耦合问题
## 参考论文
[[agent-harness-engineering-survey]]