SidneyZhang/myWiki

Files

Sidney Zhang 91fac5b6fc

20260617:目前有914 页

2026-06-17 15:02:40 +08:00

1.8 KiB

Raw Blame History

title, created, updated, type, tags, sources

title

created

updated

type

tags

sources

感知-认知推荐层次 (R0-R3)

2026-06-10

2026-06-10

concept

recommendation

reasoning

perception

cognition

hierarchy

raw/papers/onereason-team-onereason-2026.md

感知-认知推荐层次 (R0-R3)

OneReason 提出的推荐推理四层递进体系：Perception → Derivation → Evolution → Recommendation。

层次结构

R0: Perception（感知）

基础能力：将 itemic-tokens 扎根到其显式语义内容中。

Item Understanding：给定 item，生成自然语言描述
Itemic Pattern Grounding：给定描述，定位到对应 item
Item QA：基于 item 内容的问答

没有 R0，用户行为完全不可解读。

R1: Derivation（推导）

从单个 item 语义推理 item-to-item 关系。

通过常识/知识关联从噪声交互历史中提取潜在兴趣
任务：Item2Item 关联匹配

R2: Evolution（演化）

将同一潜在兴趣的 item 建模为时序过程。

捕捉长期/短期/周期性偏好
任务：演化行为选择、演化主题生成、演化链直接生成

R3: Recommendation（推荐）

在所有服务域中连贯推理，产出推荐决策。

单域推荐（视频/商品/广告/直播）
跨域推荐

设计哲学

推荐推理是 abductive-reasoning-recommendation 而非演绎——从行为反推隐含兴趣点。R0-R3 层次将这一过程中每个诊断能力独立评估和训练。

与 OneReason-Bench 的关系

onereason-bench 的评测体系直接对应 R0-R3 层次，作为训练各阶段的测量协议。

参考