Files
myWiki/concepts/hallucination-mitigation.md

40 lines
1.3 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Hallucination Mitigation in LLM Systems"
type: concept
created: 2026-06-04
tags: [hallucination, llm, safety, faithfulness]
---
# Hallucination Mitigation幻觉抑制
**定义**:在 LLM 系统中抑制模型生成不忠实于输入或事实的信息的一组技术和方法。
## 方法分类
### 1. 检索锚定
- [[rag|RAG]]:通过外部检索提供事实基础
- [[content-grounded-retrieval]]:严格限定于提供的内容
### 2. 推理闸门
- [[sufficiency-check|充分性检查]]:评估证据是否充足后再生成
- Self-Consistency多路径采样投票
### 3. 训练时抑制
- [[rlhf|RLHF]]:通过人类偏好对齐减少编造
- [[dpo|DPO]]:直接偏好优化
- Factuality tuning事实性微调
### 4. 推理时控制
- 约束解码constrained decoding
- 引用强制citation enforcement
## IntrAgent 的贡献
[[intragent|IntrAgent]] 的 [[sufficiency-check|充分性检查]] 属于**推理闸门**类——在生成最终答案前显式评估信息充分性。与 RAG 的隐式可靠性不同,这是一个显式的质量控制步骤。
## 相关概念
- [[sufficiency-check]] — 充分性检查作为幻觉抑制闸门
- [[content-grounded-retrieval]] — 内容锚定抑制幻觉
- [[faithfulness-in-ai]] — AI 忠实性