20260625:很多新内容
This commit is contained in:
37
concepts/unlimited-ocr.md
Normal file
37
concepts/unlimited-ocr.md
Normal file
@@ -0,0 +1,37 @@
|
||||
---
|
||||
title: "Unlimited OCR 模型"
|
||||
created: 2026-06-24
|
||||
updated: 2026-06-24
|
||||
type: concept
|
||||
tags: ["ocr", "attention-mechanism", "long-horizon", "end-to-end", "baidu"]
|
||||
sources:
|
||||
- "[[unlimited-ocr-works-2026]]"
|
||||
---
|
||||
|
||||
# Unlimited OCR
|
||||
|
||||
Unlimited OCR 是百度提出的端到端长程 OCR 模型。以 DeepSeek OCR 为基线,将所有 decoder 注意力层替换为 R-SWA,实现恒定 KV cache + 恒定推理速度。
|
||||
|
||||
## 架构
|
||||
|
||||
- 继承 DeepEncoder(16× 压缩,冻结训练)
|
||||
- Decoder:3B MoE,激活 500M,全部注意力替换为 R-SWA
|
||||
- 训练:4000 步,8×16 A800,32K 序列长度,DeepEP EP=4
|
||||
|
||||
## 核心性能
|
||||
|
||||
- OmniDocBench v1.5:93.23%(+6.22pp over DeepSeek OCR)
|
||||
- 2-40+ 页长程解析:一次前向
|
||||
- 推理 TPS 恒定,6000 token 时领先 35%
|
||||
|
||||
## 认知启发
|
||||
|
||||
人类长程抄写时只关注附近上下文,不回溯全部历史。R-SWA 的 soft forgetting 与此一致。
|
||||
|
||||
## 参考
|
||||
- [[unlimited-ocr-works-2026]]
|
||||
- [[reference-sliding-window-attention]]
|
||||
- [[deepseek-ocr]]
|
||||
- [[deepencoder]]
|
||||
- [[constant-kv-cache]]
|
||||
- [[long-horizon-parsing]]
|
||||
Reference in New Issue
Block a user