20260601
This commit is contained in:
40
concepts/recursive-reasoning-models.md
Normal file
40
concepts/recursive-reasoning-models.md
Normal file
@@ -0,0 +1,40 @@
|
||||
---
|
||||
title: "Recursive Reasoning Models(递归推理模型)"
|
||||
created: 2026-05-23
|
||||
updated: 2026-05-23
|
||||
type: concept
|
||||
tags: [reasoning, recursive, latent, model-architecture]
|
||||
sources: [raw/papers/gram-generative-recursive-reasoning-2026.md]
|
||||
confidence: high
|
||||
---
|
||||
|
||||
# Recursive Reasoning Models (RRM)
|
||||
|
||||
> 通过重复应用共享转移函数来精炼持久潜在状态——而非追加新元素到输出/推理序列——从而在紧凑模型上实现长距离推理。
|
||||
|
||||
## 核心思想
|
||||
|
||||
与自回归模型不同,RRM 将推理深度与参数规模、输出长度**解耦**:
|
||||
- 小模型可以通过反复应用共享转移函数执行多步内部计算
|
||||
- 不需要生成显式的推理 token(区别于 Chain-of-Thought)
|
||||
|
||||
## 代表性工作
|
||||
|
||||
- **HRM** (Hierarchical Recursive Models)
|
||||
- **TRM** (Tree Recursive Models)
|
||||
- 适用于约束传播、状态追踪、迭代校正、多步推理
|
||||
|
||||
## 确定性局限
|
||||
|
||||
现有 RRM 的关键缺陷:给定相同输入和初始化,它们遵循**单一潜在轨迹**,收敛到**唯一预测**。这意味着:
|
||||
- 无法维持不确定性
|
||||
- 无法探索多个解
|
||||
- 单条精炼路径可能陷入次优
|
||||
|
||||
→ 这正是 [[gram-generative-recursive-reasoning|GRAM]] 要解决的问题。
|
||||
|
||||
## 相关概念
|
||||
|
||||
- [[stochastic-latent-trajectory]]
|
||||
- [[deep-and-wide-reasoning]]
|
||||
- [[gram-generative-recursive-reasoning-paper|GRAM 论文]]
|
||||
Reference in New Issue
Block a user