276 lines
16 KiB
Markdown
276 lines
16 KiB
Markdown
# Wiki Log
|
||
|
||
> 所有 wiki 操作的按时间顺序记录。仅追加。
|
||
> 格式:`## [YYYY-MM-DD] action | subject`
|
||
> 操作类型:ingest, update, query, lint, create, archive, delete
|
||
> 当此文件超过 500 条记录时,轮换:重命名为 log-YYYY.md,重新开始。
|
||
|
||
## [2026-04-27] ingest | DeepSeek-V4 技术报告 (HuggingFace)
|
||
- 来源:https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main/DeepSeek_V4.pdf
|
||
- 作者:DeepSeek-AI
|
||
- PDF:4.4MB,提取 4906 行文本
|
||
- 新增文件 (14 个):
|
||
- `raw/papers/deepseek-ai-deepseek-v4-2026.md` — 原始论文存档
|
||
- `papers/deepseek-v4-million-token-context.md` — 论文主页面
|
||
- Tier 1 核心概念 (5 个):
|
||
- `concepts/compressed-sparse-attention.md` — CSA 压缩稀疏注意力
|
||
- `concepts/heavily-compressed-attention.md` — HCA 高强度压缩注意力
|
||
- `concepts/manifold-constrained-hyper-connections.md` — mHC 流形约束超连接
|
||
- `concepts/muon-optimizer.md` — Muon 优化器
|
||
- `concepts/on-policy-distillation.md` — OPD 在线策略蒸馏
|
||
- Tier 2 基础概念 (4 个):
|
||
- `concepts/hybrid-attention-architecture.md` — 混合注意力架构
|
||
- `concepts/mixture-of-experts.md` — MoE 混合专家
|
||
- `concepts/fp4-quantization-training.md` — FP4 量化感知训练
|
||
- `concepts/specialist-training-pipeline.md` — 专家训练流水线
|
||
- Tier 3 占位符概念 (3 个):
|
||
- `concepts/multi-token-prediction.md` — MTP 多 Token 预测
|
||
- `concepts/test-time-scaling.md` — 测试时扩展
|
||
- `concepts/million-token-context.md` — 百万 Token 上下文
|
||
- 关键概念:CSA/HCA 混合注意力、mHC 双随机矩阵约束、Muon 优化器、OPD 多教师蒸馏
|
||
- 更新 index.md:总页面数 57 → 71
|
||
|
||
## [2026-04-20] merge | 合并 /home/ubuntu/wiki 到 /home/ubuntu/wikiplace
|
||
- 来源:旧 wiki 路径(默认回退路径 ~/wiki)
|
||
- 操作:将 wiki 独有的文件合并到 wikiplace
|
||
- 新增文件:
|
||
- `concepts/computerized-adaptive-testing.md` — CAT 测试综述
|
||
- `concepts/cramer-rao-lower-bound.md` — CRLB 参数估计下界
|
||
- `concepts/knowledge-bank.md` — AI 辅助开发知识管理系统
|
||
- `concepts/symbolic-regression.md` — 符号回归技术
|
||
- `raw/articles/knowledge-bank-ai-dev-2026.md` — Knowledge Bank 微信公众号原文
|
||
- `raw/papers/hbs-cramerrao-bound-notes.md` — HBS CRLB 培训材料摘要
|
||
- `raw/papers/zhuang-catsurvey-ml-2024.md` — CAT 综述论文元数据
|
||
- `raw/papers/cramerrao-bound-notes.pdf` — HBS CRLB 培训 PDF
|
||
- `raw/papers/odrzywolek-eml-universal-operator-2026.pdf` — EML 论文 PDF
|
||
- 合并更新:
|
||
- `concepts/eml-operator.md` — 补充了符号回归联系、布尔逻辑类比、研究意义和更多开放问题
|
||
- `entities/andrzej-odrzywolek.md` — 补充了发表文献、发现方法、重要意义和外部链接
|
||
- 更新 index.md:总页面数 24 → 28
|
||
- 更新 log.md:追加合并记录
|
||
|
||
## [2025-04-15] create | Wiki 初始化
|
||
- 领域:数学研究、AI/ML 研究、编程技术、学习笔记与阅读资料
|
||
- 创建结构:SCHEMA.md, index.md, log.md
|
||
- 目录结构:raw/, entities/, concepts/, comparisons/, queries/
|
||
|
||
## [2025-04-15] ingest | Mathematical methods and human thought in the age of AI
|
||
- 来源:arXiv:2603.26524
|
||
- 作者:[[Terence Tao]], [[Tanya Klowden]]
|
||
- 保存至:raw/papers/tao-ai-mathematical-methods-2026.md
|
||
- 创建页面:
|
||
- entities/papers/tao-klowden-ai-mathematical-methods.md
|
||
- entities/terence-tao.md
|
||
- entities/tanya-klowden.md
|
||
- concepts/human-centered-ai.md
|
||
- concepts/formal-verification.md
|
||
- concepts/ai-mathematics.md
|
||
- 更新 index.md:总页面数 6
|
||
|
||
## [2026-04-16] ingest | All elementary functions from a single binary operator
|
||
- 来源:arXiv:2603.21852 [cs.SC]
|
||
- 作者:[[Andrzej Odrzywołek]]
|
||
- 保存至:raw/papers/odrzywolek-eml-single-operator-2026.md
|
||
- 创建页面:
|
||
- papers/odrzywolek-eml-single-operator.md — EML 算子论文摘要
|
||
- entities/andrzej-odrzywolek.md — 作者实体页面
|
||
- concepts/eml-operator.md — EML 算子概念页面
|
||
- 更新 index.md:总页面数 9
|
||
- 关键概念:EML Sheffer 算子、二叉树语法、符号回归、连续数学完备性
|
||
|
||
## [2026-04-19] ingest | Memory Caching: RNNs with Growing Memory
|
||
- 来源:arXiv:2602.24281 [cs.LG]
|
||
- 作者:Ali Behrouz, Zeman Li, Yuan Deng, Peilin Zhong, Meisam Razaviyayn, Vahab Mirrokni
|
||
- 保存至:raw/papers/behrouz-memory-caching-rnn-2026.md
|
||
- 创建页面:
|
||
- papers/behrouz-memory-caching-rnn.md — MC 论文笔记
|
||
- concepts/memory-caching-rnn.md — Memory Caching 技术详解
|
||
- concepts/subquadratic-transformer-alternatives.md — 次二次 Transformer 替代方案综述
|
||
- 更新 index.md:总页面数 12
|
||
- 关键概念:Memory Caching、RNN 增长记忆、次二次复杂度、隐藏状态缓存、门控聚合
|
||
|
||
## [2026-04-19] ingest | "Are You Sure?": Human Perception Vulnerability in LLM Agents
|
||
- 来源:arXiv:2602.21127 [cs.HC]
|
||
- 作者:Xinfeng Li, Shenyu Dai, Kelong Zheng, Yue Xiao, Gelei Deng, Wei Dong, Xiaofeng Wang
|
||
- 保存至:raw/papers/li-amd-human-perception-2026.md
|
||
- 创建页面:
|
||
- papers/li-amd-human-perception.md — AMD 实证研究论文笔记
|
||
- concepts/agent-mediated-deception.md — AMD 攻击模式详解
|
||
- concepts/human-agent-trust.md — 人机信任与脆弱性
|
||
- 更新 index.md:总页面数 14
|
||
- 关键概念:Agent-Mediated Deception、HAT-Lab、认知失败模式、经验学习、信任校准
|
||
|
||
## [2026-04-19] ingest | Prefill-as-a-Service: KVCache Goes Cross-Datacenter
|
||
- 来源:arXiv:2604.15039 [cs.DC]
|
||
- 作者:Ruoyu Qin, Weiran He, Yaoyu Wang, Zheming Li, Xinran Xu, Yongwei Wu, Weimin Zheng, Mingxing Zhang
|
||
- 保存至:raw/papers/qin-prfaas-cross-datacenter-2026.md
|
||
- 创建页面:
|
||
- papers/qin-prfaas-cross-datacenter.md — PrfaaS 论文笔记
|
||
- concepts/prefill-as-a-service.md — PrfaaS 架构详解
|
||
- concepts/prefill-decode-disaggregation.md — PD 分离架构演进
|
||
- concepts/kvcache-transfer.md — KVCache 传输与优化
|
||
- 更新 index.md:总页面数 17
|
||
- 关键概念:Prefill-as-a-Service、跨数据中心部署、KVCache 传输、混合注意力、带宽感知调度
|
||
|
||
## [2026-04-19] ingest | Mixture-of-Depths Attention (MoDA)
|
||
- 来源:arXiv:2603.15619 [cs.LG]
|
||
- 作者:Lianghui Zhu, Yuxin Fang, Bencheng Liao, Shijie Wang, Tianheng Cheng, Zilong Huang, Chen Chen, Lai Wei, Yutao Zeng, Ya Wang, Yi Lin, Yu Li, Xinggang Wang
|
||
- 保存至:raw/papers/zhu-moda-mixture-of-depths-2026.md
|
||
- 创建页面:
|
||
- papers/zhu-moda-mixture-of-depths.md — MoDA 论文笔记
|
||
- concepts/mixture-of-depths-attention.md — MoDA 机制详解
|
||
- concepts/depth-scaling-signal-degradation.md — 深度扩展与信号退化问题
|
||
- 更新 index.md:总页面数 21
|
||
- 关键概念:Mixture-of-Depths Attention、信号退化、跨层 KV 访问、硬件高效实现、Post-Norm 优势
|
||
|
||
## [2026-04-19] ingest | OPPO 多模态数据湖实践 (WeChat Article)
|
||
- 来源:微信公众号文章 (DataFun / Data for AI Meetup)
|
||
- 分享人:David (OPPO 大数据架构负责人)
|
||
- 链接:https://mp.weixin.qq.com/s/cBaYa04qAIGsxG1hD7ll3w
|
||
- 保存至:raw/articles/oppo-multimodal-data-lake-2026.md
|
||
- 创建页面:
|
||
- articles/oppo-multimodal-data-lake.md — 文章核心架构与成果总结
|
||
- concepts/gravitino-unified-metadata.md — Gravitino 统一元数据管理
|
||
- concepts/curvine-distributed-cache.md — Curvine 分布式缓存系统
|
||
- 更新 index.md:新增 Articles 分区,总页面数 24
|
||
- 关键概念:多模态数据湖、Gravitino 元数据、Curvine 缓存、LanceDB 加速、混合云架构
|
||
|
||
## [2026-04-20] ingest | Spurious Predictability in Financial Machine Learning
|
||
- 来源:arXiv:2604.15531 [q-fin.ST, stat.ME, stat.ML]
|
||
- 作者:Sotirios D. Nikolopoulos
|
||
- 保存至:raw/papers/nikolopoulos-spurious-predictability-2026.md
|
||
- 创建页面:
|
||
- papers/nikolopoulos-spurious-predictability.md — 金融机器学习虚假可预测性论文笔记
|
||
- concepts/spurious-predictability.md — 虚假可预测性概念详解
|
||
- 更新 index.md:总页面数 30
|
||
|
||
## [2026-04-20] ingest | Hyperagents: Self-Referential Agents with Metacognitive Self-Modification
|
||
- 来源:arXiv:2603.19461 [cs.AI]
|
||
- 作者:Jenny Zhang, Bingchen Zhao, Wannan Yang, Jakob Foerster, Jeff Clune, Minqi Jiang, Sam Devlin, Tatiana Shavrina
|
||
- 保存至:raw/papers/zhang-hyperagents-2026.md
|
||
- 创建页面:
|
||
- papers/zhang-hyperagents.md — 超智能体论文笔记
|
||
- concepts/hyperagents.md — 超智能体概念详解
|
||
- concepts/self-improving-ai.md — 自我改进人工智能概念
|
||
- concepts/darwin-godel-machine.md — 达尔文·哥德尔机概念
|
||
- concepts/metacognitive-self-modification.md — 元认知自我修改概念
|
||
- 更新 index.md:总页面数 35
|
||
- 关键概念:超智能体、自我改进 AI、达尔文·哥德尔机、元认知自我修改、自我加速进展、可编辑元级
|
||
|
||
## [2026-04-20] fix | 修复超智能体相关概念的断链
|
||
- 修复问题:新创建页面中存在指向未创建概念的链接
|
||
- 创建缺失概念页面:
|
||
- concepts/meta-learning.md — 元学习概念
|
||
- concepts/recursive-self-improvement.md — 递归自我改进概念
|
||
- concepts/genetic-programming.md — 遗传编程概念
|
||
- concepts/program-synthesis.md — 程序合成概念
|
||
- concepts/cognitive-architecture.md — 认知架构概念
|
||
- concepts/singularity.md — 技术奇点概念
|
||
- 创建占位符概念页面(修复剩余断链):
|
||
- concepts/ai-alignment.md — AI 对齐概念
|
||
- concepts/ai-safety.md — AI 安全概念
|
||
- concepts/neuroscience.md — 神经科学概念
|
||
- concepts/evolutionary-algorithms.md — 进化算法概念
|
||
- concepts/few-shot-learning.md — 少样本学习概念
|
||
- concepts/transfer-learning.md — 迁移学习概念
|
||
- 更新 index.md:总页面数 46
|
||
- 修复效果:消除所有新页面中的断链,建立完整的概念网络
|
||
- 关键概念:虚假可预测性、证伪审计、选择诱导性能膨胀、有效多重性、金融机器学习方法论
|
||
|
||
## [2026-04-22] ingest | ClawLess: A Security Model of AI Agents
|
||
- 来源:arXiv:2604.06284v1 [cs.CR]
|
||
- 作者:Hongyi Lu, Nian Liu, Shuai Wang, Fengwei Zhang
|
||
- 机构:南方科技大学,香港科技大学
|
||
- 保存至:raw/papers/lu-hongyi-clawless-ai-agent-security-2026.md
|
||
- 创建页面:
|
||
- papers/clawless-ai-agent-security.md — ClawLess 论文笔记
|
||
- concepts/clawless.md — ClawLess 安全框架概念
|
||
- concepts/ai-agent-security.md — AI 代理安全概念
|
||
- concepts/formal-security-model.md — 形式化安全模型概念
|
||
- concepts/userspace-kernel.md — 用户空间内核概念
|
||
- concepts/bpf-syscall-interception.md — BPF系统调用拦截概念
|
||
- concepts/secure-containers.md — 安全容器概念
|
||
- concepts/worst-case-threat-model.md — 最坏情况威胁模型概念
|
||
- 更新 index.md:总页面数 46 → 53
|
||
- 关键概念:ClawLess、AI代理安全、形式化安全模型、用户空间内核、BPF系统调用拦截、安全容器、最坏情况威胁模型
|
||
|
||
## [2026-04-22] ingest | Crawl4AI: 开源智能网页爬虫与数据提取工具
|
||
- 来源:知乎专栃 https://zhuanlan.zhihu.com/p/717965307
|
||
- 作者:沈飞
|
||
- 保存至:raw/articles/shenfei-crawl4ai-open-source-web-crawler-2024.md
|
||
- 创建页面:
|
||
- articles/crawl4ai-open-source-web-crawler.md — Crawl4AI 文章主页面
|
||
- concepts/crawl4ai.md — Crawl4AI 工具概念页面
|
||
- concepts/rag-systems.md — RAG 系统概念页面
|
||
- concepts/llm-applications.md — LLM 应用概念页面
|
||
- 更新 index.md:总页面数 53 → 57
|
||
- 关键概念:Crawl4AI、网页爬虫、数据提取、RAG、LLM应用、Markdown转换
|
||
|
||
---
|
||
|
||
## 2026-04-28 | 哥德尔不完备定理教程
|
||
|
||
- **来源**: PDF 直接提交 (godel_tutorial.pdf),2026年4月综合教程
|
||
- **作者**: 无明确单一作者(面向数学系本科生的教学资料)
|
||
- **新增页面**: 25 个(1 论文 + 1 原始存档 + 23 概念)
|
||
- raw/papers/godel-tutorial-2026.md — 原始存档
|
||
- papers/godel-incompleteness-tutorial.md — 论文主页面
|
||
- concepts/godel-incompleteness-theorems.md — 哥德尔不完备定理
|
||
- concepts/godel-numbering.md — 哥德尔编码
|
||
- concepts/hilberts-program.md — 希尔伯特计划
|
||
- concepts/peano-arithmetic.md — 皮亚诺算术
|
||
- concepts/self-reference.md — 自指
|
||
- concepts/diagonalization-method.md — 对角线方法
|
||
- concepts/halting-problem.md — 停机问题
|
||
- concepts/lucas-penrose-argument.md — 卢卡斯-彭罗斯论证
|
||
- concepts/chaitin-algorithmic-information-theory.md — 算法信息论
|
||
- concepts/metamathematics.md — 元数学
|
||
- concepts/primitive-recursive-functions.md — 原始递归函数
|
||
- concepts/computability-theory.md — 可计算性理论
|
||
- concepts/formal-systems.md — 形式系统
|
||
- concepts/automated-theorem-proving.md — 自动定理证明
|
||
- concepts/paris-harrington-theorem.md — 巴黎-哈灵顿定理
|
||
- concepts/goodsteins-theorem.md — 古德斯坦定理
|
||
- concepts/russells-paradox.md — 罗素悖论
|
||
- concepts/continuum-hypothesis.md — 连续统假设
|
||
- concepts/consistency-logic.md — 一致性
|
||
- concepts/completeness-logic.md — 完备性
|
||
- concepts/mathematical-pluralism.md — 数学多元主义
|
||
- concepts/chaitin-constant.md — 蔡廷常数
|
||
- concepts/kolmogorov-complexity.md — 柯尔莫哥洛夫复杂度
|
||
- 更新 index.md:总页面数 71 → 96
|
||
- 关键概念:哥德尔不完备定理、哥德尔编码、自指、对角线方法、停机问题、希尔伯特计划、可计算性、形式系统
|
||
## [2026-04-29] ingest | 大语言模型注意力机制全面分析 (综述论文)
|
||
- 来源:用户直接上传 PDF (LLM注意力机制全面分析.pdf)
|
||
- 类型:综述论文 / Review Paper,2026年4月
|
||
- PDF:1385 行文本提取
|
||
- 新增文件 (21 个):
|
||
- `raw/papers/llm-attention-survey-2026.md` — 原始论文存档
|
||
- `papers/llm-attention-survey-2026.md` — 论文主页面
|
||
- Tier 1 核心概念 (6 个):
|
||
- `concepts/multi-head-attention.md` — MHA 标准多头注意力
|
||
- `concepts/grouped-query-attention.md` — GQA 分组查询注意力
|
||
- `concepts/multi-head-latent-attention.md` — MLA 多潜在头注意力
|
||
- `concepts/flash-attention.md` — FlashAttention IO感知优化
|
||
- `concepts/attention-entropy-collapse.md` — 注意力熵崩溃
|
||
- `concepts/kv-cache-bottleneck.md` — KV缓存内存瓶颈
|
||
- Tier 2 基础概念 (5 个):
|
||
- `concepts/multi-query-attention.md` — MQA 多查询注意力
|
||
- `concepts/sparse-attention-patterns.md` — 稀疏注意力模式
|
||
- `concepts/linear-attention-methods.md` — 线性注意力方法
|
||
- `concepts/rotary-position-embedding.md` — RoPE 旋转位置编码
|
||
- `concepts/lost-in-the-middle.md` — Lost in the Middle 现象
|
||
- Tier 3 占位概念 (8 个):
|
||
- `concepts/attention-sinks.md` — 注意力汇
|
||
- `concepts/flash-attention-3.md` — FlashAttention-3
|
||
- `concepts/mamba-ssm.md` — Mamba 状态空间模型
|
||
- `concepts/mixture-of-attention-schemes.md` — MoAS 注意力方案混合
|
||
- `concepts/duo-attention.md` — DuoAttention 双模式注意力
|
||
- `concepts/seer-attention.md` — SeerAttention 可学习稀疏
|
||
- `concepts/ntk-aware-interpolation.md` — NTK-aware 位置插值
|
||
- `concepts/native-sparse-attention.md` — NSA 原生稀疏注意力
|
||
- 更新 index.md:总页面数 96 → 116
|
||
- 关键概念:注意力机制演化谱系 (MHA→MQA→GQA→MLA)、FlashAttention、注意力退化、KV缓存瓶颈、Lost in the Middle
|
||
- 网络连接:与已有概念 CSA、HCA、混合注意力架构、DeepSeek-V4 等形成密集交叉引用
|
||
|