20260518-morning:新增内容

This commit is contained in:
2026-05-18 10:17:56 +08:00
parent b116710e4c
commit 2faf4bb002
38 changed files with 1528 additions and 2 deletions

View File

@@ -1,10 +1,11 @@
# LLM Wiki
> 知识索引页面 — 自动生成
> 最后更新2026-05-14 | 总页面数300
> 最后更新2026-05-15 | 总页面数335
## Concepts
- [[adaptive-computation-time]] — 根据输入难度动态调整计算量的技术族ACT, PonderNet 等)
- [[additive-combinatorics]]
- [[agent-communication-stack]]
- [[agent-mediated-deception]]
@@ -17,20 +18,25 @@
- [[ai-alignment]]
- [[ai-mathematics]]
- [[ai-safety]]
- [[analytical-report-synthesizer]] — LLM 驱动的预测结果→分析报告自动生成器
- [[api-key-authentication]]
- [[attention-entropy-collapse]]
- [[attention-sinks]]
- [[automated-theorem-proving]]
- [[backtranslation-round-trip-relay]] — 回译接力:通过可逆编辑链评估 LLM 文档编辑保真度
- [[base-table-embedding]] — DIME 第一阶段:双路径编码捕获表内语义
- [[behrouz-memory-caching-rnn]]
- [[bidirectional-trajectory-evaluation]]
- [[bpf-syscall-interception]]
- [[cache-health-observability]]
- [[cache-hit-ratio]]
- [[cache-invalidation]]
- [[cache-safe-forking]]
- [[caddy-reverse-proxy-auth]]
- [[caddy-web-server]]
- [[cel-shading-style]]
- [[centralized-agent-architecture]]
- [[certainty-based-loss]] — 通过 argmin(loss) + argmax(certainty) 双 tick 选择实现原生自适应计算
- [[certainty-based-rewards]]
- [[chain-of-thought]]
- [[chaitin-algorithmic-information-theory]]
@@ -38,12 +44,15 @@
- [[cl-bench-life]]
- [[classifier-free-guidance-language]] — CFG 在语言扩散模型中的应用
- [[clawless]]
- [[clawless-ai-agent-security]]
- [[coarse-grained-counting]]
- [[cognitive-architecture]]
- [[completeness-logic]]
- [[composable-base-model-architecture]] — 基础模型池 + 共享组件的可组合建模框架
- [[compressed-sparse-attention]]
- [[computability-theory]]
- [[computerized-adaptive-testing]]
- [[conditional-model-dispatcher]] — ZCP + 历史 EMA 驱动的模型选择与条件增强调度器
- [[confidence-correctness-alignment]]
- [[consistency-logic]]
- [[context-blue-clique]]
@@ -51,27 +60,38 @@
- [[context-learning]]
- [[context-misuse]]
- [[continuous-diffusion-language-models]] — 连续嵌入空间中的扩散语言模型
- [[continuous-thought-machine]] — CTM以神经时序动态和同步为核心计算原理的新架构
- [[continuum-hypothesis]]
- [[cramer-rao-lower-bound]]
- [[crawl4ai]]
- [[crawl4ai-open-source-web-crawler]]
- [[critical-failures]] — 关键失败稀疏但严重的错误解释了约80%的文档退化
- [[curvine-distributed-cache]]
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
- [[darwin-godel-machine]]
- [[data-slice]] — 任务特定的关系数据库子集DIME 的核心数据对象
- [[decentralized-agent-architecture]]
- [[deepseek-v4-flash]]
- [[deepseek-v4-million-token-context]]
- [[deepseek-vit]]
- [[delegate-52]] — Microsoft 基准310工作环境 × 52专业领域评估LLM委托工作就绪性
- [[delegated-work]] — 委托工作新兴LLM交互范式用户监督模型代其完成任务
- [[depth-scaling-signal-degradation]]
- [[diagonal-ramsey-number]]
- [[diagonalization-method]]
- [[dime-dynamic-in-database-modeling-engine]] — DIMENeurIDA 的核心动态建模引擎
- [[discrete-diffusion-language-models]] — 离散 token 空间中的扩散语言模型
- [[distractor-context]] — 干扰上下文:话题相关但无需编辑的文档,模拟不完美检索精度
- [[document-degradation]] — 文档退化LLM在长委托工作流中静默破坏文档内容的现象
- [[domain-knowledge-reasoning]]
- [[domain-specific-evaluation]] — 领域特定评估:每个领域自定义解析器和语义等价评分的评估方法
- [[dou-cl-bench]]
- [[duo-attention]]
- [[dynamic-in-database-modeling]] — 从共享组件在查询时装配定制模型的新范式
- [[dynamic-mode-decomposition]]
- [[dynamic-model-fusion]] — 上下文感知的选择性关系融合模块
- [[dynamic-relation-modeling]] — 跨表关系结构感知的消息传递
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
- [[embedded-language-flows]] — ELF: 连续嵌入流匹配语言模型
- [[eml-operator]]
- [[empirical-discovery-simulation]]
@@ -96,9 +116,11 @@
- [[geometric-ramsey-theory]]
- [[glitch-art-style]]
- [[godel-incompleteness-theorems]]
- [[godel-incompleteness-tutorial]]
- [[godel-numbering]]
- [[goodsteins-theorem]]
- [[gpt-image2]]
- [[gpt-image2-prompt-collection]]
- [[gravitino-unified-metadata]]
- [[greedy-context-screening]]
- [[green-tao-theorem]]
@@ -106,15 +128,20 @@
- [[grouped-query-attention]]
- [[halftone-print-style]]
- [[halting-problem]]
- [[he-urlvr-sharpening-2026]]
- [[heavily-compressed-attention]]
- [[hilberts-program]]
- [[human-agent-trust]]
- [[human-centered-ai]]
- [[hunyuan-team-cl-bench-life]]
- [[hybrid-attention-architecture]]
- [[hyperagents]]
- [[hypergraph-ramsey-number]]
- [[identity-reference-resolution]]
- [[image-generation-prompt-design]]
- [[in-database-analytics]] — 在 DBMS 内部直接执行 ML/分析任务的方法论
- [[internal-ticks]] — 与数据维度解耦的内部时序CTM 的「思考步骤」展开维度
- [[internal-world-model]] — agent 内部构建的环境表征,在 CTM 迷宫任务中涌现
- [[intrinsic-rewards-sharpening]]
- [[jagged-frontier]] — 锯齿前沿AI模型能力在不同领域间不均衡、不可预测的分布
- [[klein-blue]]
@@ -125,10 +152,15 @@
- [[koopman-theory]]
- [[kv-cache-bottleneck]]
- [[kvcache-transfer]]
- [[laban-llms-corrupt-documents-delegate]] — "LLMs Corrupt Your Documents When You Delegate" — DELEGATE-52
- [[length-extrapolation]] — 长度外推:让 LLM 处理超出预训练窗口的序列长度
- [[li-amd-human-perception]]
- [[linear-attention-methods]]
- [[liu-koopa-2023]]
- [[llm-applications]]
- [[llm-attention-survey-2026]]
- [[llm-evaluation-benchmarks]]
- [[log]] — 变更日志
- [[long-context-understanding]]
- [[long-horizon-evaluation]] — 长视界评估:通过延长交互揭示短评估中不可见的退化模式
- [[lost-in-the-middle]]
@@ -156,15 +188,23 @@
- [[multimodal-large-language-model]]
- [[muon-optimizer]]
- [[native-sparse-attention]]
- [[neural-synchronization]] — 将神经元激活历史的时序相关性直接用作潜在表示
- [[neurida]] — Neural In-Database Analytics自主端到端库内分析系统
- [[neuron-level-models]] — 每个神经元拥有私有参数的 MLP替代统一激活函数
- [[neuron-pairing]] — 对 O(D²) 同步矩阵的子采样策略,用于效率与表达力平衡
- [[neuroscience]]
- [[nikolopoulos-spurious-predictability]]
- [[non-stationary-time-series]]
- [[ntk-aware-interpolation]]
- [[odrzywolek-eml-single-operator]]
- [[on-policy-distillation]]
- [[oppo-multimodal-data-lake]]
- [[paley-graph]]
- [[paris-harrington-theorem]]
- [[path-tracing]]
- [[peano-arithmetic]]
- [[perception-gap]]
- [[pre-activation-history]] — 每个神经元维护的滚动前激活缓冲区NLM 的输入
- [[prefill-as-a-service]]
- [[prefill-decode-disaggregation]]
- [[prefix-matching]]
@@ -173,21 +213,28 @@
- [[procedural-task-execution]]
- [[program-synthesis]]
- [[prompt-caching]]
- [[prompt-caching-architecture]]
- [[prompt-layering]]
- [[prompt-reverse-engineering]]
- [[qin-prfaas-cross-datacenter]]
- [[query-intent-analyzer]] — LLM 驱动的 NLQ 解析器,输出结构化任务/数据画像
- [[rag-systems]]
- [[ramsey-context-cache]]
- [[ramsey-context-construction]]
- [[ramsey-context-graph]]
- [[ramsey-context-template]]
- [[ramsey-numbers]]
- [[ramsey-numbers-survey]]
- [[ramsey-theory]]
- [[ramsey-theory-applications]]
- [[random-graph-theory]]
- [[README]] — Wiki 说明
- [[real-life-context-learning]]
- [[rectified-flows]] — Flow Matching 中的直线插值路径
- [[recursive-self-improvement]]
- [[reference-gap]]
- [[reinforcement-learning-trading]]
- [[relational-graph]] — 以 FK-PK 为边的元组图,关系建模的数据结构基础
- [[reverse-proxy-authentication]]
- [[reward-hacking-llm]]
- [[reward-model]]
@@ -199,6 +246,7 @@
- [[rule-system-application]]
- [[russells-paradox]]
- [[russian-constructivism]]
- [[SCHEMA]] — Wiki 结构规范
- [[sde-sampler-language]] — 语言扩散中的随机微分方程采样器
- [[secure-containers]]
- [[seer-attention]]
@@ -211,19 +259,27 @@
- [[singularity]]
- [[sink-token]] — 可学习汇 Token预训练时添加专用 Token 作为唯一注意力汇
- [[softmax-off-by-one]] — SoftMax₁允许丢弃多余注意力的 SoftMax 变体
- [[song-agent-network-taxonomy]]
- [[sparse-attention-patterns]]
- [[specialist-training-pipeline]]
- [[specialized-rl]]
- [[specialized-sft]]
- [[spiking-neural-networks]] — 使用离散脉冲和事件驱动计算的生物启发神经网络
- [[spurious-predictability]]
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
- [[stub-pattern]]
- [[subquadratic-transformer-alternatives]]
- [[symbolic-regression]]
- [[synapse-model]] — CTM 的 U-Net 风格循环突触结构,神经元间信息共享引擎
- [[system-2-thinking]]
- [[system-message-abuse]]
- [[szemerédi-regularity-lemma]]
- [[tabular-foundation-models]] — 大规模表格数据预训练的基础模型TabPFN, TabICL 等)
- [[tao-klowden-ai-mathematical-methods]]
- [[temporal-decay-neural]] — 每对神经元可学习的指数衰减参数,控制同步的时间尺度
- [[test-time-scaling]]
- [[test-time-training-rl]]
- [[thinking-with-visual-primitives]]
- [[time-variant-dynamics]]
- [[token-efficiency]]
- [[tool-registry]]
@@ -236,11 +292,19 @@
- [[window-attention]] — 窗口注意力:仅缓存最近 Token 的朴素方案,因驱逐注意力汇而崩溃
- [[worst-case-threat-model]]
- [[x-prediction-parameterization]] — Flow Matching 中直接预测干净数据的参数化
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
- [[zero-cost-proxies]] — 无需完整训练即可估计模型性能的 NAS 技术
- [[zhang-hyperagents]]
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
- [[zhu-moda-mixture-of-depths]]
## Papers
- [[behrouz-memory-caching-rnn]]
- [[clawless-ai-agent-security]]
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
- [[deepseek-v4-million-token-context]]
- [[dou-cl-bench]]
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
@@ -259,7 +323,11 @@
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
- [[tao-klowden-ai-mathematical-methods]]
- [[thinking-with-visual-primitives]]
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
- [[zhang-hyperagents]]
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
- [[zhu-moda-mixture-of-depths]]
## Articles
@@ -275,4 +343,8 @@
- [[SCHEMA]] — Wiki 结构规范
- [[log]] — 变更日志
- [[README]] — Wiki 说明
- [[README]] — Wiki 说明
## Reviews
- [[ctm-review-20260515]] — CTM 论文集成 Review (2026-05-15)