42 lines
1.2 KiB
Markdown
42 lines
1.2 KiB
Markdown
---
|
||
title: "Multi-Trajectory Inference(多轨迹推理)"
|
||
created: 2026-05-23
|
||
updated: 2026-05-23
|
||
type: concept
|
||
tags: [reasoning, multi-trajectory, inference, parallel]
|
||
sources: [raw/papers/gram-generative-recursive-reasoning-2026.md]
|
||
confidence: medium
|
||
---
|
||
|
||
# Multi-Trajectory Inference
|
||
|
||
> GRAM 的核心推理范式:不沿单一路径精炼,而是采样**多条潜在推理轨迹**,在轨迹间维持多个假设和替代策略。
|
||
|
||
## 为什么需要多轨迹
|
||
|
||
现有确定性 [[recursive-reasoning-models|RRM]] 的关键失败模式:
|
||
- 单一路径可能陷入次优吸引子
|
||
- 无法同时探索多个有效解
|
||
- 推理路径空间被坍缩为单点
|
||
|
||
## GRAM 的实现
|
||
|
||
- 推理过程 = 从 p(z_T|x) 采样
|
||
- 每次采样产生一条完整轨迹
|
||
- 多条轨迹 -> p(y|x) 的蒙特卡洛估计
|
||
- 轨迹间可并行(天然适合 batch)
|
||
|
||
## 与多解问题的关系
|
||
|
||
在 [[multi-solution-recovery|多解恢复]] 场景中,多轨迹推理天然优势:
|
||
- 不同轨迹收敛到不同有效解
|
||
- 一次运行恢复整个解集
|
||
- 无需顺序尝试
|
||
|
||
## 相关概念
|
||
|
||
- [[stochastic-latent-trajectory]]
|
||
- [[inference-time-scaling]]
|
||
- [[width-based-scaling]]
|
||
- [[gram-generative-recursive-reasoning|GRAM]]
|