41 lines
1.3 KiB
Markdown
41 lines
1.3 KiB
Markdown
---
|
||
title: "Recursive Reasoning Models(递归推理模型)"
|
||
created: 2026-05-23
|
||
updated: 2026-05-23
|
||
type: concept
|
||
tags: [reasoning, recursive, latent, model-architecture]
|
||
sources: [raw/papers/gram-generative-recursive-reasoning-2026.md]
|
||
confidence: high
|
||
---
|
||
|
||
# Recursive Reasoning Models (RRM)
|
||
|
||
> 通过重复应用共享转移函数来精炼持久潜在状态——而非追加新元素到输出/推理序列——从而在紧凑模型上实现长距离推理。
|
||
|
||
## 核心思想
|
||
|
||
与自回归模型不同,RRM 将推理深度与参数规模、输出长度**解耦**:
|
||
- 小模型可以通过反复应用共享转移函数执行多步内部计算
|
||
- 不需要生成显式的推理 token(区别于 Chain-of-Thought)
|
||
|
||
## 代表性工作
|
||
|
||
- **HRM** (Hierarchical Recursive Models)
|
||
- **TRM** (Tree Recursive Models)
|
||
- 适用于约束传播、状态追踪、迭代校正、多步推理
|
||
|
||
## 确定性局限
|
||
|
||
现有 RRM 的关键缺陷:给定相同输入和初始化,它们遵循**单一潜在轨迹**,收敛到**唯一预测**。这意味着:
|
||
- 无法维持不确定性
|
||
- 无法探索多个解
|
||
- 单条精炼路径可能陷入次优
|
||
|
||
→ 这正是 [[gram-generative-recursive-reasoning|GRAM]] 要解决的问题。
|
||
|
||
## 相关概念
|
||
|
||
- [[stochastic-latent-trajectory]]
|
||
- [[deep-and-wide-reasoning]]
|
||
- [[gram-generative-recursive-reasoning-paper|GRAM 论文]]
|