SidneyZhang/myWiki

Files

Sidney Zhang 91fac5b6fc

20260617:目前有914 页

2026-06-17 15:02:40 +08:00

1.5 KiB

Raw Blame History

title, created, updated, type, tags, sources

title

created

updated

type

tags

sources

Information Flow Control

2026-06-05

2026-06-05

concept

agent-safety

multi-agent

communication

policy

liu-auditing-agent-harness-safety

Information Flow Control

信息流控制（Information Flow Control）是 agent-harness-safety中约束 Agent 之间信息共享的机制，对应 policy-constrained-execution 中的信息流策略 Φ。在 harnessaudit 的 L1 boundary-compliance 中作为三个审计通道之一。

问题本质

在 multi-agent-safety中，Agent 需要通信来协调任务——但通信引入了安全风险：一个 Agent 可能在正确的通信链路上传输了不该共享的内容。

HarnessAudit 定义的 Φ 策略包含：

允许的通信通道：哪些角色之间可以通信
允许的共享内容：哪些类型的信息可以跨越角色边界
禁止的披露：哪些内容不得对外暴露（如 PII、内部状态）

违规模式

HarnessAudit 实验揭示了两个关键发现：

Agent 能正确识别通信对象但无法控制共享内容——接收方判定准确但内容泄露频繁
信息流违规主要表现为敏感信息泄露而非授权接收方错误——这意味着骨架在通信路由层面表现尚可，但在内容过滤层面存在系统性缺陷

这与 agent-communication-stack的安全挑战一致——信息一旦通过通信接口发出，骨架就失去了对它的控制。