SidneyZhang/myWiki

Files

Sidney Zhang 91fac5b6fc

20260617:目前有914 页

2026-06-17 15:02:40 +08:00

1.7 KiB

Raw Blame History

title, created, updated, type, tags, sources

title

created

updated

type

tags

sources

Itemic-Text 对齐 (Itemic-Text Alignment)

2026-06-10

2026-06-10

concept

recommendation

alignment

cross-modal

perception

raw/papers/onereason-team-onereason-2026.md

Itemic-Text 对齐 (Itemic-Text Alignment)

将 itemic-tokens 与自然语言 token 在语义空间中深度对齐，是为推荐系统解锁推理能力的前提条件。

问题

generative-recommendation 中的 itemic token 是「语义空洞」的——模型只能学习表面的 token 转移模式，无法理解 token 背后代表的商品/视频/直播的语义内容。

解决方案

OneReason 在 Pre-training 阶段强化 itemic-text 对齐：

Coarse-to-fine 对齐语料：收集复杂多样的对齐预训练语料
跨模态语义空间：将新增的 itemic token 与 text token 在同一语义空间中对齐
四粒度预训练：不同粒度（token/item/sequence/session）的对齐样本

为什么对齐是推理的前提

借鉴多模态 LLM 的研究发现：

Alignment Failure 模式：当视觉和文本模态未充分对齐时，模型会机械地「读取」表面视觉文本而非真正推理底层语义
Perception→Cognition 路径：高级交互式推理无法在建立稳健跨模态感知对齐之前涌现

在 OneReason 框架中的位置

Itemic-text 对齐是 perception-cognition-recommendation 的核心，为 R1-R3 各层推理提供语义基础。

参考