Files
myWiki/concepts/recursive-reasoning-models.md
2026-06-01 10:46:01 +08:00

41 lines
1.3 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Recursive Reasoning Models递归推理模型"
created: 2026-05-23
updated: 2026-05-23
type: concept
tags: [reasoning, recursive, latent, model-architecture]
sources: [raw/papers/gram-generative-recursive-reasoning-2026.md]
confidence: high
---
# Recursive Reasoning Models (RRM)
> 通过重复应用共享转移函数来精炼持久潜在状态——而非追加新元素到输出/推理序列——从而在紧凑模型上实现长距离推理。
## 核心思想
与自回归模型不同RRM 将推理深度与参数规模、输出长度**解耦**
- 小模型可以通过反复应用共享转移函数执行多步内部计算
- 不需要生成显式的推理 token区别于 Chain-of-Thought
## 代表性工作
- **HRM** (Hierarchical Recursive Models)
- **TRM** (Tree Recursive Models)
- 适用于约束传播、状态追踪、迭代校正、多步推理
## 确定性局限
现有 RRM 的关键缺陷:给定相同输入和初始化,它们遵循**单一潜在轨迹**,收敛到**唯一预测**。这意味着:
- 无法维持不确定性
- 无法探索多个解
- 单条精炼路径可能陷入次优
→ 这正是 [[gram-generative-recursive-reasoning|GRAM]] 要解决的问题。
## 相关概念
- [[stochastic-latent-trajectory]]
- [[deep-and-wide-reasoning]]
- [[gram-generative-recursive-reasoning-paper|GRAM 论文]]