SidneyZhang/myWiki

Files

Sidney Zhang e96b955fda

20260601

2026-06-01 10:46:01 +08:00

1.3 KiB

Raw Blame History

title, created, updated, type, tags, sources, confidence

title

created

updated

type

tags

sources

confidence

Recursive Reasoning Models（递归推理模型）

2026-05-23

2026-05-23

concept

reasoning

recursive

latent

model-architecture

raw/papers/gram-generative-recursive-reasoning-2026.md

high

Recursive Reasoning Models (RRM)

通过重复应用共享转移函数来精炼持久潜在状态——而非追加新元素到输出/推理序列——从而在紧凑模型上实现长距离推理。

核心思想

与自回归模型不同，RRM 将推理深度与参数规模、输出长度解耦：

小模型可以通过反复应用共享转移函数执行多步内部计算
不需要生成显式的推理 token（区别于 Chain-of-Thought）

代表性工作

HRM (Hierarchical Recursive Models)
TRM (Tree Recursive Models)
适用于约束传播、状态追踪、迭代校正、多步推理

确定性局限

现有 RRM 的关键缺陷：给定相同输入和初始化，它们遵循单一潜在轨迹，收敛到唯一预测。这意味着：

无法维持不确定性
无法探索多个解
单条精炼路径可能陷入次优

→ 这正是 gram-generative-recursive-reasoning 要解决的问题。

相关概念