Files
myWiki/concepts/million-token-context.md

36 lines
1.1 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Million-Token Context"
domain: "Machine Learning / Long-Context Models"
tags: [long-context, efficiency, inference, kv-cache]
sources: [[deepseek-v4-million-token-context]]
---
# Million-Token Context
> **类型**: Concept (Tier 3 — Placeholder)
> **来源**: [[deepseek-v4-million-token-context]]
## 概述
百万 Token 上下文是指语言模型能够高效处理的序列长度达到 1,000,000 个 token。这是 DeepSeek-V4 系列的核心突破——通过 [[hybrid-attention-architecture]] 等技术创新,实现了在百万 token 上下文下仅为 DeepSeek-V3.2 27%Pro或 10%Flash的推理 FLOPs。
## 关键技术
- [[compressed-sparse-attention]] + [[heavily-compressed-attention]] 混合注意力
- [[fp4-quantization-training]] FP4 量化
- 异构 KV Cache 与磁盘存储策略
## 核心内容
*此页面为占位符,用于修复 wiki 中的断链。详细内容待后续补充。*
## 相关概念
- [[hybrid-attention-architecture]] — 混合注意力架构
- [[test-time-scaling]] — 测试时扩展
---
*Last Updated: 2026-04-27*
*Status: Placeholder — to be completed*