20260617:目前有914 页
This commit is contained in:
46
concepts/recommendation-cot.md
Normal file
46
concepts/recommendation-cot.md
Normal file
@@ -0,0 +1,46 @@
|
||||
---
|
||||
title: "推荐思维链 (Recommendation CoT)"
|
||||
created: 2026-06-10
|
||||
updated: 2026-06-10
|
||||
type: concept
|
||||
tags: [recommendation, chain-of-thought, reasoning]
|
||||
sources: [raw/papers/onereason-team-onereason-2026.md]
|
||||
---
|
||||
|
||||
# 推荐思维链 (Recommendation CoT)
|
||||
|
||||
> 专门为推荐任务设计的 Chain-of-Thought 推理格式,区别于通用 LLM 的 CoT。
|
||||
|
||||
## 与通用 CoT 的区别
|
||||
|
||||
通用 [[chain-of-thought|思维链 (CoT)]] 面向数学/代码/逻辑等有唯一正确答案的任务。推荐 CoT 需要处理:
|
||||
|
||||
- **多解输出**:多个 item 都可以是合理推荐
|
||||
- **隐含意图**:用户意图不可观测,需从 [[itemic-tokens|itemic token]] 序列推断
|
||||
- **时序演化**:兴趣随时间变化,需建模长期/短期/周期偏好
|
||||
|
||||
## OneReason 的三层 CoT 结构
|
||||
|
||||
OneReason 提出 cognition-enhanced CoT,包含三个层次:
|
||||
|
||||
### R1: Derivation(推导)
|
||||
从单个 item 语义推理 item-to-item 关系,通过常识或知识关联从噪声交互历史中提取潜在兴趣。
|
||||
|
||||
### R2: Evolution(演化)
|
||||
将同一潜在兴趣的 item 建模为时序过程,捕捉长期、短期和周期性偏好。
|
||||
|
||||
### R3: Recommendation(推荐)
|
||||
在所有服务域(短视频、直播等)中连贯推理,产出高质量推荐决策。
|
||||
|
||||
## CoT 鲁棒性的两个条件
|
||||
|
||||
借鉴多模态 LLM 研究,推荐 CoT 的有效性依赖:
|
||||
1. **[[itemic-text-alignment|跨模态对齐]]**(感知层面)
|
||||
2. **粗到细的推理格式**(认知层面)
|
||||
|
||||
## 参考
|
||||
|
||||
- [[chain-of-thought|思维链 (CoT)]]
|
||||
- [[onereason|OneReason]]
|
||||
- [[recommendation-reasoning|推荐推理]]
|
||||
- [[perception-cognition-recommendation|感知-认知推荐层次]]
|
||||
Reference in New Issue
Block a user