Files
myWiki/concepts/width-based-scaling.md
2026-06-01 10:46:01 +08:00

39 lines
1.1 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Width-Based Scaling宽度扩展"
created: 2026-05-23
updated: 2026-05-23
type: concept
tags: [inference, scaling, width, parallel]
sources: [raw/papers/gram-generative-recursive-reasoning-2026.md]
confidence: medium
---
# Width-Based Scaling
> GRAM 引入的新扩展维度:通过增加并行采样的潜在推理轨迹数量来提升推理性能,而不增加模型大小或序列长度。
## 工作原理
- 从 [[stochastic-latent-trajectory]] 分布中采样 K 条轨迹
- K 条轨迹可以**完全并行**运行(天然 batch
- 最终预测 = 聚合 K 条轨迹的结果
## 宽度 vs 深度
- **深度**:单条轨迹的推理质量(精炼程度)
- **宽度**:轨迹覆盖的多样性(探索广度)
- 两者正交,可以独立调参
## 与 Ensemble 的区别
GRAM 的宽度扩展 != 传统 Ensemble
- Ensemble 需要多个独立模型
- GRAM 的宽度 = 同一模型的多个随机实现
- 单一模型参数,多条推理路径
## 相关概念
- [[inference-time-scaling]]
- [[multi-trajectory-inference]]
- [[deep-and-wide-reasoning]]