20260617:目前有914 页

2026-06-17 15:02:40 +08:00
parent e96b955fda
commit 91fac5b6fc
423 changed files with 20687 additions and 34 deletions
--- a/concepts/openclaw.md
+++ b/concepts/openclaw.md
@@ -0,0 +1,38 @@
+---
+title: "OpenClaw"
+created: 2026-06-15
+updated: 2026-06-15
+type: concept
+tags: [coding-agent, multi-agent, tool-use]
+sources: [raw/papers/zheng-claw-swe-bench-2026.md]
+---
+
+# OpenClaw
+
+## 定义
+
+OpenClaw 是一个通用目的 agent，被广泛用作自主工具使用者——覆盖生产力工具、浏览器自动化、computer-use 任务和科学辅助。在 Claw-SWE-Bench 中，OpenClaw 作为**参考 claw**（reference harness），用于隔离 LLM 维度的变化。
+
+## 在 Claw-SWE-Bench 中的表现
+
+### 作为参考 claw（9 模型扫掠）
+| 模型 | OpenClaw Pass@1 |
+|------|-----------------|
+| GPT 5.5 | 78.0% |
+| Claude Opus 4.7 | 77.1% |
+| GLM 5.1 | 73.4% |
+| DeepSeek-V4 Pro | 71.7% |
+| DeepSeek-V4 Flash | 70.3% |
+| Qwen 3.6-flash | 66.0% |
+
+### 作为 claw 之一（5 claw 扫掠）
+在 GLM 5.1 上 OpenClaw 以 73.4% 排名第一，Hermes-Agent 以 71.1% 紧随其后。
+
+## 与 SWE-bench 的适配挑战
+
+OpenClaw 本身不满足 SWE-bench 的评分合约——它通常通过最终文本、结构化消息或内部日志发信号表示完成，而 SWE-bench evaluator 只读取 `model_patch` 字段。Claw-SWE-Bench 通过 adapter protocol 桥接这一 gap。
+
+## 参考
+- [[claw-swe-bench|Claw-SWE-Bench 论文]]
+- [[agent-harness|Agent Harness]]
+- [[adapter-protocol|适配器协议]]