20260518-morning:新增内容
This commit is contained in:
76
index.md
76
index.md
@@ -1,10 +1,11 @@
|
||||
# LLM Wiki
|
||||
|
||||
> 知识索引页面 — 自动生成
|
||||
> 最后更新:2026-05-14 | 总页面数:300
|
||||
> 最后更新:2026-05-15 | 总页面数:335
|
||||
|
||||
## Concepts
|
||||
|
||||
- [[adaptive-computation-time]] — 根据输入难度动态调整计算量的技术族(ACT, PonderNet 等)
|
||||
- [[additive-combinatorics]]
|
||||
- [[agent-communication-stack]]
|
||||
- [[agent-mediated-deception]]
|
||||
@@ -17,20 +18,25 @@
|
||||
- [[ai-alignment]]
|
||||
- [[ai-mathematics]]
|
||||
- [[ai-safety]]
|
||||
- [[analytical-report-synthesizer]] — LLM 驱动的预测结果→分析报告自动生成器
|
||||
- [[api-key-authentication]]
|
||||
- [[attention-entropy-collapse]]
|
||||
- [[attention-sinks]]
|
||||
- [[automated-theorem-proving]]
|
||||
- [[backtranslation-round-trip-relay]] — 回译接力:通过可逆编辑链评估 LLM 文档编辑保真度
|
||||
- [[base-table-embedding]] — DIME 第一阶段:双路径编码捕获表内语义
|
||||
- [[behrouz-memory-caching-rnn]]
|
||||
- [[bidirectional-trajectory-evaluation]]
|
||||
- [[bpf-syscall-interception]]
|
||||
- [[cache-health-observability]]
|
||||
- [[cache-hit-ratio]]
|
||||
- [[cache-invalidation]]
|
||||
- [[cache-safe-forking]]
|
||||
- [[caddy-reverse-proxy-auth]]
|
||||
- [[caddy-web-server]]
|
||||
- [[cel-shading-style]]
|
||||
- [[centralized-agent-architecture]]
|
||||
- [[certainty-based-loss]] — 通过 argmin(loss) + argmax(certainty) 双 tick 选择实现原生自适应计算
|
||||
- [[certainty-based-rewards]]
|
||||
- [[chain-of-thought]]
|
||||
- [[chaitin-algorithmic-information-theory]]
|
||||
@@ -38,12 +44,15 @@
|
||||
- [[cl-bench-life]]
|
||||
- [[classifier-free-guidance-language]] — CFG 在语言扩散模型中的应用
|
||||
- [[clawless]]
|
||||
- [[clawless-ai-agent-security]]
|
||||
- [[coarse-grained-counting]]
|
||||
- [[cognitive-architecture]]
|
||||
- [[completeness-logic]]
|
||||
- [[composable-base-model-architecture]] — 基础模型池 + 共享组件的可组合建模框架
|
||||
- [[compressed-sparse-attention]]
|
||||
- [[computability-theory]]
|
||||
- [[computerized-adaptive-testing]]
|
||||
- [[conditional-model-dispatcher]] — ZCP + 历史 EMA 驱动的模型选择与条件增强调度器
|
||||
- [[confidence-correctness-alignment]]
|
||||
- [[consistency-logic]]
|
||||
- [[context-blue-clique]]
|
||||
@@ -51,27 +60,38 @@
|
||||
- [[context-learning]]
|
||||
- [[context-misuse]]
|
||||
- [[continuous-diffusion-language-models]] — 连续嵌入空间中的扩散语言模型
|
||||
- [[continuous-thought-machine]] — CTM:以神经时序动态和同步为核心计算原理的新架构
|
||||
- [[continuum-hypothesis]]
|
||||
- [[cramer-rao-lower-bound]]
|
||||
- [[crawl4ai]]
|
||||
- [[crawl4ai-open-source-web-crawler]]
|
||||
- [[critical-failures]] — 关键失败:稀疏但严重的错误解释了约80%的文档退化
|
||||
- [[curvine-distributed-cache]]
|
||||
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
|
||||
- [[darwin-godel-machine]]
|
||||
- [[data-slice]] — 任务特定的关系数据库子集,DIME 的核心数据对象
|
||||
- [[decentralized-agent-architecture]]
|
||||
- [[deepseek-v4-flash]]
|
||||
- [[deepseek-v4-million-token-context]]
|
||||
- [[deepseek-vit]]
|
||||
- [[delegate-52]] — Microsoft 基准:310工作环境 × 52专业领域,评估LLM委托工作就绪性
|
||||
- [[delegated-work]] — 委托工作:新兴LLM交互范式,用户监督模型代其完成任务
|
||||
- [[depth-scaling-signal-degradation]]
|
||||
- [[diagonal-ramsey-number]]
|
||||
- [[diagonalization-method]]
|
||||
- [[dime-dynamic-in-database-modeling-engine]] — DIME:NeurIDA 的核心动态建模引擎
|
||||
- [[discrete-diffusion-language-models]] — 离散 token 空间中的扩散语言模型
|
||||
- [[distractor-context]] — 干扰上下文:话题相关但无需编辑的文档,模拟不完美检索精度
|
||||
- [[document-degradation]] — 文档退化:LLM在长委托工作流中静默破坏文档内容的现象
|
||||
- [[domain-knowledge-reasoning]]
|
||||
- [[domain-specific-evaluation]] — 领域特定评估:每个领域自定义解析器和语义等价评分的评估方法
|
||||
- [[dou-cl-bench]]
|
||||
- [[duo-attention]]
|
||||
- [[dynamic-in-database-modeling]] — 从共享组件在查询时装配定制模型的新范式
|
||||
- [[dynamic-mode-decomposition]]
|
||||
- [[dynamic-model-fusion]] — 上下文感知的选择性关系融合模块
|
||||
- [[dynamic-relation-modeling]] — 跨表关系结构感知的消息传递
|
||||
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
|
||||
- [[embedded-language-flows]] — ELF: 连续嵌入流匹配语言模型
|
||||
- [[eml-operator]]
|
||||
- [[empirical-discovery-simulation]]
|
||||
@@ -96,9 +116,11 @@
|
||||
- [[geometric-ramsey-theory]]
|
||||
- [[glitch-art-style]]
|
||||
- [[godel-incompleteness-theorems]]
|
||||
- [[godel-incompleteness-tutorial]]
|
||||
- [[godel-numbering]]
|
||||
- [[goodsteins-theorem]]
|
||||
- [[gpt-image2]]
|
||||
- [[gpt-image2-prompt-collection]]
|
||||
- [[gravitino-unified-metadata]]
|
||||
- [[greedy-context-screening]]
|
||||
- [[green-tao-theorem]]
|
||||
@@ -106,15 +128,20 @@
|
||||
- [[grouped-query-attention]]
|
||||
- [[halftone-print-style]]
|
||||
- [[halting-problem]]
|
||||
- [[he-urlvr-sharpening-2026]]
|
||||
- [[heavily-compressed-attention]]
|
||||
- [[hilberts-program]]
|
||||
- [[human-agent-trust]]
|
||||
- [[human-centered-ai]]
|
||||
- [[hunyuan-team-cl-bench-life]]
|
||||
- [[hybrid-attention-architecture]]
|
||||
- [[hyperagents]]
|
||||
- [[hypergraph-ramsey-number]]
|
||||
- [[identity-reference-resolution]]
|
||||
- [[image-generation-prompt-design]]
|
||||
- [[in-database-analytics]] — 在 DBMS 内部直接执行 ML/分析任务的方法论
|
||||
- [[internal-ticks]] — 与数据维度解耦的内部时序,CTM 的「思考步骤」展开维度
|
||||
- [[internal-world-model]] — agent 内部构建的环境表征,在 CTM 迷宫任务中涌现
|
||||
- [[intrinsic-rewards-sharpening]]
|
||||
- [[jagged-frontier]] — 锯齿前沿:AI模型能力在不同领域间不均衡、不可预测的分布
|
||||
- [[klein-blue]]
|
||||
@@ -125,10 +152,15 @@
|
||||
- [[koopman-theory]]
|
||||
- [[kv-cache-bottleneck]]
|
||||
- [[kvcache-transfer]]
|
||||
- [[laban-llms-corrupt-documents-delegate]] — "LLMs Corrupt Your Documents When You Delegate" — DELEGATE-52
|
||||
- [[length-extrapolation]] — 长度外推:让 LLM 处理超出预训练窗口的序列长度
|
||||
- [[li-amd-human-perception]]
|
||||
- [[linear-attention-methods]]
|
||||
- [[liu-koopa-2023]]
|
||||
- [[llm-applications]]
|
||||
- [[llm-attention-survey-2026]]
|
||||
- [[llm-evaluation-benchmarks]]
|
||||
- [[log]] — 变更日志
|
||||
- [[long-context-understanding]]
|
||||
- [[long-horizon-evaluation]] — 长视界评估:通过延长交互揭示短评估中不可见的退化模式
|
||||
- [[lost-in-the-middle]]
|
||||
@@ -156,15 +188,23 @@
|
||||
- [[multimodal-large-language-model]]
|
||||
- [[muon-optimizer]]
|
||||
- [[native-sparse-attention]]
|
||||
- [[neural-synchronization]] — 将神经元激活历史的时序相关性直接用作潜在表示
|
||||
- [[neurida]] — Neural In-Database Analytics:自主端到端库内分析系统
|
||||
- [[neuron-level-models]] — 每个神经元拥有私有参数的 MLP,替代统一激活函数
|
||||
- [[neuron-pairing]] — 对 O(D²) 同步矩阵的子采样策略,用于效率与表达力平衡
|
||||
- [[neuroscience]]
|
||||
- [[nikolopoulos-spurious-predictability]]
|
||||
- [[non-stationary-time-series]]
|
||||
- [[ntk-aware-interpolation]]
|
||||
- [[odrzywolek-eml-single-operator]]
|
||||
- [[on-policy-distillation]]
|
||||
- [[oppo-multimodal-data-lake]]
|
||||
- [[paley-graph]]
|
||||
- [[paris-harrington-theorem]]
|
||||
- [[path-tracing]]
|
||||
- [[peano-arithmetic]]
|
||||
- [[perception-gap]]
|
||||
- [[pre-activation-history]] — 每个神经元维护的滚动前激活缓冲区,NLM 的输入
|
||||
- [[prefill-as-a-service]]
|
||||
- [[prefill-decode-disaggregation]]
|
||||
- [[prefix-matching]]
|
||||
@@ -173,21 +213,28 @@
|
||||
- [[procedural-task-execution]]
|
||||
- [[program-synthesis]]
|
||||
- [[prompt-caching]]
|
||||
- [[prompt-caching-architecture]]
|
||||
- [[prompt-layering]]
|
||||
- [[prompt-reverse-engineering]]
|
||||
- [[qin-prfaas-cross-datacenter]]
|
||||
- [[query-intent-analyzer]] — LLM 驱动的 NLQ 解析器,输出结构化任务/数据画像
|
||||
- [[rag-systems]]
|
||||
- [[ramsey-context-cache]]
|
||||
- [[ramsey-context-construction]]
|
||||
- [[ramsey-context-graph]]
|
||||
- [[ramsey-context-template]]
|
||||
- [[ramsey-numbers]]
|
||||
- [[ramsey-numbers-survey]]
|
||||
- [[ramsey-theory]]
|
||||
- [[ramsey-theory-applications]]
|
||||
- [[random-graph-theory]]
|
||||
- [[README]] — Wiki 说明
|
||||
- [[real-life-context-learning]]
|
||||
- [[rectified-flows]] — Flow Matching 中的直线插值路径
|
||||
- [[recursive-self-improvement]]
|
||||
- [[reference-gap]]
|
||||
- [[reinforcement-learning-trading]]
|
||||
- [[relational-graph]] — 以 FK-PK 为边的元组图,关系建模的数据结构基础
|
||||
- [[reverse-proxy-authentication]]
|
||||
- [[reward-hacking-llm]]
|
||||
- [[reward-model]]
|
||||
@@ -199,6 +246,7 @@
|
||||
- [[rule-system-application]]
|
||||
- [[russells-paradox]]
|
||||
- [[russian-constructivism]]
|
||||
- [[SCHEMA]] — Wiki 结构规范
|
||||
- [[sde-sampler-language]] — 语言扩散中的随机微分方程采样器
|
||||
- [[secure-containers]]
|
||||
- [[seer-attention]]
|
||||
@@ -211,19 +259,27 @@
|
||||
- [[singularity]]
|
||||
- [[sink-token]] — 可学习汇 Token:预训练时添加专用 Token 作为唯一注意力汇
|
||||
- [[softmax-off-by-one]] — SoftMax₁:允许丢弃多余注意力的 SoftMax 变体
|
||||
- [[song-agent-network-taxonomy]]
|
||||
- [[sparse-attention-patterns]]
|
||||
- [[specialist-training-pipeline]]
|
||||
- [[specialized-rl]]
|
||||
- [[specialized-sft]]
|
||||
- [[spiking-neural-networks]] — 使用离散脉冲和事件驱动计算的生物启发神经网络
|
||||
- [[spurious-predictability]]
|
||||
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
|
||||
- [[stub-pattern]]
|
||||
- [[subquadratic-transformer-alternatives]]
|
||||
- [[symbolic-regression]]
|
||||
- [[synapse-model]] — CTM 的 U-Net 风格循环突触结构,神经元间信息共享引擎
|
||||
- [[system-2-thinking]]
|
||||
- [[system-message-abuse]]
|
||||
- [[szemerédi-regularity-lemma]]
|
||||
- [[tabular-foundation-models]] — 大规模表格数据预训练的基础模型(TabPFN, TabICL 等)
|
||||
- [[tao-klowden-ai-mathematical-methods]]
|
||||
- [[temporal-decay-neural]] — 每对神经元可学习的指数衰减参数,控制同步的时间尺度
|
||||
- [[test-time-scaling]]
|
||||
- [[test-time-training-rl]]
|
||||
- [[thinking-with-visual-primitives]]
|
||||
- [[time-variant-dynamics]]
|
||||
- [[token-efficiency]]
|
||||
- [[tool-registry]]
|
||||
@@ -236,11 +292,19 @@
|
||||
- [[window-attention]] — 窗口注意力:仅缓存最近 Token 的朴素方案,因驱逐注意力汇而崩溃
|
||||
- [[worst-case-threat-model]]
|
||||
- [[x-prediction-parameterization]] — Flow Matching 中直接预测干净数据的参数化
|
||||
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
|
||||
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
|
||||
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
|
||||
- [[zero-cost-proxies]] — 无需完整训练即可估计模型性能的 NAS 技术
|
||||
- [[zhang-hyperagents]]
|
||||
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
|
||||
- [[zhu-moda-mixture-of-depths]]
|
||||
|
||||
## Papers
|
||||
|
||||
- [[behrouz-memory-caching-rnn]]
|
||||
- [[clawless-ai-agent-security]]
|
||||
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
|
||||
- [[deepseek-v4-million-token-context]]
|
||||
- [[dou-cl-bench]]
|
||||
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
|
||||
@@ -259,7 +323,11 @@
|
||||
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
|
||||
- [[tao-klowden-ai-mathematical-methods]]
|
||||
- [[thinking-with-visual-primitives]]
|
||||
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
|
||||
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
|
||||
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
|
||||
- [[zhang-hyperagents]]
|
||||
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
|
||||
- [[zhu-moda-mixture-of-depths]]
|
||||
|
||||
## Articles
|
||||
@@ -275,4 +343,8 @@
|
||||
|
||||
- [[SCHEMA]] — Wiki 结构规范
|
||||
- [[log]] — 变更日志
|
||||
- [[README]] — Wiki 说明
|
||||
- [[README]] — Wiki 说明
|
||||
|
||||
## Reviews
|
||||
|
||||
- [[ctm-review-20260515]] — CTM 论文集成 Review (2026-05-15)
|
||||
Reference in New Issue
Block a user