Files
myWiki/concepts/memory-indexing-retrieval-reading.md

68 lines
2.0 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Memory Indexing-Retrieval-Reading Framework"
created: 2026-06-25
updated: 2026-06-25
type: concept
tags: ["memory", "architecture", "rag", "framework"]
sources:
- "[[longmem-eval-2025]]"
---
# Memory Indexing-Retrieval-Reading Framework
LongMemEval 提出的统一记忆设计框架,将长期记忆系统分解为三个阶段 × 四个控制点。
## 三阶段流水线
```
会话输入
[Indexing] → 存储结构化的记忆表示
[Retrieval] → 根据查询召回相关记忆
[Reading] → 基于检索结果生成准确答案
```
## 四个控制点
| 控制点 | 决策 | 优化方向 |
|--------|------|---------|
| **Value** | 存储什么粒度? | Session vs Round vs User Fact |
| **Key** | 用什么索引? | 原文 vs [[fact-augmented-key-expansion|事实增强]] |
| **Query** | 如何构造查询? | 原文 vs [[time-aware-query-expansion|时间感知展开]] |
| **Reading Strategy** | 如何利用检索结果? | 直接 vs Chain-of-Note + 结构化格式 |
## 实验发现的优化路径
### ValueRound 是最优粒度
- Session 级别:信息损失大,无法精确回溯
- Round 级别:每个用户消息为独立单元,最优平衡
- User Fact 级别:压缩导致信息损失,总体精度反降(但多会话推理提升)
### Key事实增强展开
用 LLM 从对话中提取结构化事实作为索引键 → 召回 +9.4%
### Query时间感知展开
关联时间戳 + 缩小搜索范围 → 时间推理召回 +6.8-11.3%
### ReadingChain-of-Note + 结构化
即使完美召回 ≠ 完美利用 → +10 个绝对百分点
## 与 Atlas 管线的映射
```
LongMemEval Atlas
─────────── ─────
Indexing → write_memory (episodic)
+ Key展开 → consolidation (→semantic)
Retrieval → recall_memory (BM25+dense)
Reading → LLM 利用检索结果生成回复
```
## 参考
- [[longmem-eval-2025]]
- [[fact-augmented-key-expansion]]
- [[time-aware-query-expansion]]
- [[atlas-memory-system]]