20260625:很多新内容
This commit is contained in:
67
concepts/memory-indexing-retrieval-reading.md
Normal file
67
concepts/memory-indexing-retrieval-reading.md
Normal file
@@ -0,0 +1,67 @@
|
||||
---
|
||||
title: "Memory Indexing-Retrieval-Reading Framework"
|
||||
created: 2026-06-25
|
||||
updated: 2026-06-25
|
||||
type: concept
|
||||
tags: ["memory", "architecture", "rag", "framework"]
|
||||
sources:
|
||||
- "[[longmem-eval-2025]]"
|
||||
---
|
||||
|
||||
# Memory Indexing-Retrieval-Reading Framework
|
||||
|
||||
LongMemEval 提出的统一记忆设计框架,将长期记忆系统分解为三个阶段 × 四个控制点。
|
||||
|
||||
## 三阶段流水线
|
||||
|
||||
```
|
||||
会话输入
|
||||
↓
|
||||
[Indexing] → 存储结构化的记忆表示
|
||||
↓
|
||||
[Retrieval] → 根据查询召回相关记忆
|
||||
↓
|
||||
[Reading] → 基于检索结果生成准确答案
|
||||
```
|
||||
|
||||
## 四个控制点
|
||||
|
||||
| 控制点 | 决策 | 优化方向 |
|
||||
|--------|------|---------|
|
||||
| **Value** | 存储什么粒度? | Session vs Round vs User Fact |
|
||||
| **Key** | 用什么索引? | 原文 vs [[fact-augmented-key-expansion|事实增强]] |
|
||||
| **Query** | 如何构造查询? | 原文 vs [[time-aware-query-expansion|时间感知展开]] |
|
||||
| **Reading Strategy** | 如何利用检索结果? | 直接 vs Chain-of-Note + 结构化格式 |
|
||||
|
||||
## 实验发现的优化路径
|
||||
|
||||
### Value:Round 是最优粒度
|
||||
- Session 级别:信息损失大,无法精确回溯
|
||||
- Round 级别:每个用户消息为独立单元,最优平衡
|
||||
- User Fact 级别:压缩导致信息损失,总体精度反降(但多会话推理提升)
|
||||
|
||||
### Key:事实增强展开
|
||||
用 LLM 从对话中提取结构化事实作为索引键 → 召回 +9.4%
|
||||
|
||||
### Query:时间感知展开
|
||||
关联时间戳 + 缩小搜索范围 → 时间推理召回 +6.8-11.3%
|
||||
|
||||
### Reading:Chain-of-Note + 结构化
|
||||
即使完美召回 ≠ 完美利用 → +10 个绝对百分点
|
||||
|
||||
## 与 Atlas 管线的映射
|
||||
|
||||
```
|
||||
LongMemEval Atlas
|
||||
─────────── ─────
|
||||
Indexing → write_memory (episodic)
|
||||
+ Key展开 → consolidation (→semantic)
|
||||
Retrieval → recall_memory (BM25+dense)
|
||||
Reading → LLM 利用检索结果生成回复
|
||||
```
|
||||
|
||||
## 参考
|
||||
- [[longmem-eval-2025]]
|
||||
- [[fact-augmented-key-expansion]]
|
||||
- [[time-aware-query-expansion]]
|
||||
- [[atlas-memory-system]]
|
||||
Reference in New Issue
Block a user