278 lines
9.3 KiB
Markdown
278 lines
9.3 KiB
Markdown
# LLM Wiki
|
||
|
||
> 知识索引页面 — 自动生成
|
||
> 最后更新:2026-05-14 | 总页面数:300
|
||
|
||
## Concepts
|
||
|
||
- [[additive-combinatorics]]
|
||
- [[agent-communication-stack]]
|
||
- [[agent-mediated-deception]]
|
||
- [[agent-network-memory-scope]]
|
||
- [[agent-network-taxonomy]]
|
||
- [[agent-network-topology]]
|
||
- [[agent-network-update-behavior]]
|
||
- [[agentic-systems]]
|
||
- [[ai-agent-security]]
|
||
- [[ai-alignment]]
|
||
- [[ai-mathematics]]
|
||
- [[ai-safety]]
|
||
- [[api-key-authentication]]
|
||
- [[attention-entropy-collapse]]
|
||
- [[attention-sinks]]
|
||
- [[automated-theorem-proving]]
|
||
- [[backtranslation-round-trip-relay]] — 回译接力:通过可逆编辑链评估 LLM 文档编辑保真度
|
||
- [[bidirectional-trajectory-evaluation]]
|
||
- [[bpf-syscall-interception]]
|
||
- [[cache-health-observability]]
|
||
- [[cache-hit-ratio]]
|
||
- [[cache-invalidation]]
|
||
- [[cache-safe-forking]]
|
||
- [[caddy-web-server]]
|
||
- [[cel-shading-style]]
|
||
- [[centralized-agent-architecture]]
|
||
- [[certainty-based-rewards]]
|
||
- [[chain-of-thought]]
|
||
- [[chaitin-algorithmic-information-theory]]
|
||
- [[chaitin-constant]]
|
||
- [[cl-bench-life]]
|
||
- [[classifier-free-guidance-language]] — CFG 在语言扩散模型中的应用
|
||
- [[clawless]]
|
||
- [[coarse-grained-counting]]
|
||
- [[cognitive-architecture]]
|
||
- [[completeness-logic]]
|
||
- [[compressed-sparse-attention]]
|
||
- [[computability-theory]]
|
||
- [[computerized-adaptive-testing]]
|
||
- [[confidence-correctness-alignment]]
|
||
- [[consistency-logic]]
|
||
- [[context-blue-clique]]
|
||
- [[context-compression]]
|
||
- [[context-learning]]
|
||
- [[context-misuse]]
|
||
- [[continuous-diffusion-language-models]] — 连续嵌入空间中的扩散语言模型
|
||
- [[continuum-hypothesis]]
|
||
- [[cramer-rao-lower-bound]]
|
||
- [[crawl4ai]]
|
||
- [[critical-failures]] — 关键失败:稀疏但严重的错误解释了约80%的文档退化
|
||
- [[curvine-distributed-cache]]
|
||
- [[darwin-godel-machine]]
|
||
- [[decentralized-agent-architecture]]
|
||
- [[deepseek-v4-flash]]
|
||
- [[deepseek-vit]]
|
||
- [[delegate-52]] — Microsoft 基准:310工作环境 × 52专业领域,评估LLM委托工作就绪性
|
||
- [[delegated-work]] — 委托工作:新兴LLM交互范式,用户监督模型代其完成任务
|
||
- [[depth-scaling-signal-degradation]]
|
||
- [[diagonal-ramsey-number]]
|
||
- [[diagonalization-method]]
|
||
- [[discrete-diffusion-language-models]] — 离散 token 空间中的扩散语言模型
|
||
- [[distractor-context]] — 干扰上下文:话题相关但无需编辑的文档,模拟不完美检索精度
|
||
- [[document-degradation]] — 文档退化:LLM在长委托工作流中静默破坏文档内容的现象
|
||
- [[domain-knowledge-reasoning]]
|
||
- [[domain-specific-evaluation]] — 领域特定评估:每个领域自定义解析器和语义等价评分的评估方法
|
||
- [[duo-attention]]
|
||
- [[dynamic-mode-decomposition]]
|
||
- [[embedded-language-flows]] — ELF: 连续嵌入流匹配语言模型
|
||
- [[eml-operator]]
|
||
- [[empirical-discovery-simulation]]
|
||
- [[ensemble-based-rewards]]
|
||
- [[evolutionary-algorithms]]
|
||
- [[exponential-decay-reward]]
|
||
- [[few-shot-learning]]
|
||
- [[fine-grained-counting]]
|
||
- [[flash-attention]]
|
||
- [[flash-attention-3]]
|
||
- [[flow-matching]] — 连续时间流匹配生成框架
|
||
- [[formal-security-model]]
|
||
- [[formal-systems]]
|
||
- [[formal-verification]]
|
||
- [[forward-authentication]]
|
||
- [[fourier-filter-dynamics]]
|
||
- [[fp4-quantization-training]]
|
||
- [[furstenberg-correspondence]]
|
||
- [[generation-verification-asymmetry]]
|
||
- [[generative-perplexity]] — 基于第三方模型评估生成质量的指标
|
||
- [[genetic-programming]]
|
||
- [[geometric-ramsey-theory]]
|
||
- [[glitch-art-style]]
|
||
- [[godel-incompleteness-theorems]]
|
||
- [[godel-numbering]]
|
||
- [[goodsteins-theorem]]
|
||
- [[gpt-image2]]
|
||
- [[gravitino-unified-metadata]]
|
||
- [[greedy-context-screening]]
|
||
- [[green-tao-theorem]]
|
||
- [[group-relative-policy-optimization]]
|
||
- [[grouped-query-attention]]
|
||
- [[halftone-print-style]]
|
||
- [[halting-problem]]
|
||
- [[heavily-compressed-attention]]
|
||
- [[hilberts-program]]
|
||
- [[human-agent-trust]]
|
||
- [[human-centered-ai]]
|
||
- [[hybrid-attention-architecture]]
|
||
- [[hyperagents]]
|
||
- [[hypergraph-ramsey-number]]
|
||
- [[identity-reference-resolution]]
|
||
- [[image-generation-prompt-design]]
|
||
- [[intrinsic-rewards-sharpening]]
|
||
- [[jagged-frontier]] — 锯齿前沿:AI模型能力在不同领域间不均衡、不可预测的分布
|
||
- [[klein-blue]]
|
||
- [[knowledge-bank]]
|
||
- [[kolmogorov-complexity]]
|
||
- [[koopman-autoencoder]]
|
||
- [[koopman-predictor]]
|
||
- [[koopman-theory]]
|
||
- [[kv-cache-bottleneck]]
|
||
- [[kvcache-transfer]]
|
||
- [[length-extrapolation]] — 长度外推:让 LLM 处理超出预训练窗口的序列长度
|
||
- [[linear-attention-methods]]
|
||
- [[llm-applications]]
|
||
- [[llm-evaluation-benchmarks]]
|
||
- [[long-context-understanding]]
|
||
- [[long-horizon-evaluation]] — 长视界评估:通过延长交互揭示短评估中不可见的退化模式
|
||
- [[lost-in-the-middle]]
|
||
- [[lovasz-local-lemma]]
|
||
- [[lucas-penrose-argument]]
|
||
- [[mamba-ssm]]
|
||
- [[manifold-constrained-hyper-connections]]
|
||
- [[mathematical-pluralism]]
|
||
- [[maze-navigation]]
|
||
- [[memory-caching-rnn]]
|
||
- [[messy-context-reasoning]]
|
||
- [[meta-jctrader]]
|
||
- [[meta-learning]]
|
||
- [[metacognitive-self-modification]]
|
||
- [[metamathematics]]
|
||
- [[million-token-context]]
|
||
- [[mixture-of-attention-schemes]]
|
||
- [[mixture-of-depths-attention]]
|
||
- [[mixture-of-experts]]
|
||
- [[model-collapse-step]]
|
||
- [[multi-head-attention]]
|
||
- [[multi-head-latent-attention]]
|
||
- [[multi-query-attention]]
|
||
- [[multi-token-prediction]]
|
||
- [[multimodal-large-language-model]]
|
||
- [[muon-optimizer]]
|
||
- [[native-sparse-attention]]
|
||
- [[neuroscience]]
|
||
- [[non-stationary-time-series]]
|
||
- [[ntk-aware-interpolation]]
|
||
- [[on-policy-distillation]]
|
||
- [[paley-graph]]
|
||
- [[paris-harrington-theorem]]
|
||
- [[path-tracing]]
|
||
- [[peano-arithmetic]]
|
||
- [[perception-gap]]
|
||
- [[prefill-as-a-service]]
|
||
- [[prefill-decode-disaggregation]]
|
||
- [[prefix-matching]]
|
||
- [[primitive-recursive-functions]]
|
||
- [[probabilistic-method]]
|
||
- [[procedural-task-execution]]
|
||
- [[program-synthesis]]
|
||
- [[prompt-caching]]
|
||
- [[prompt-layering]]
|
||
- [[prompt-reverse-engineering]]
|
||
- [[rag-systems]]
|
||
- [[ramsey-context-cache]]
|
||
- [[ramsey-context-graph]]
|
||
- [[ramsey-context-template]]
|
||
- [[ramsey-numbers]]
|
||
- [[ramsey-theory]]
|
||
- [[ramsey-theory-applications]]
|
||
- [[random-graph-theory]]
|
||
- [[real-life-context-learning]]
|
||
- [[rectified-flows]] — Flow Matching 中的直线插值路径
|
||
- [[recursive-self-improvement]]
|
||
- [[reference-gap]]
|
||
- [[reinforcement-learning-trading]]
|
||
- [[reverse-proxy-authentication]]
|
||
- [[reward-hacking-llm]]
|
||
- [[reward-model]]
|
||
- [[risograph-print-style]]
|
||
- [[rlvr-unified-framework]]
|
||
- [[rolling-kv-cache]] — 滚动 KV 缓存:StreamingLLM 的两段式固定大小缓存机制
|
||
- [[rotary-position-embedding]]
|
||
- [[round-trip-reconstruction-score]] — RS@k:衡量k次交互后文档重建质量的评估指标
|
||
- [[rule-system-application]]
|
||
- [[russells-paradox]]
|
||
- [[russian-constructivism]]
|
||
- [[sde-sampler-language]] — 语言扩散中的随机微分方程采样器
|
||
- [[secure-containers]]
|
||
- [[seer-attention]]
|
||
- [[self-conditioning]] — 用自身中间预测作为条件的扩散技术
|
||
- [[self-improving-ai]]
|
||
- [[self-reference]]
|
||
- [[self-verification-rewards]]
|
||
- [[semantic-equivalence]] — 语义等价:通过领域特定解析器衡量文档间语义等价程度的方法
|
||
- [[shared-weight-discretization]] — ELF 的共享权重去噪-解码机制
|
||
- [[singularity]]
|
||
- [[sink-token]] — 可学习汇 Token:预训练时添加专用 Token 作为唯一注意力汇
|
||
- [[softmax-off-by-one]] — SoftMax₁:允许丢弃多余注意力的 SoftMax 变体
|
||
- [[sparse-attention-patterns]]
|
||
- [[specialist-training-pipeline]]
|
||
- [[specialized-rl]]
|
||
- [[specialized-sft]]
|
||
- [[spurious-predictability]]
|
||
- [[stub-pattern]]
|
||
- [[subquadratic-transformer-alternatives]]
|
||
- [[symbolic-regression]]
|
||
- [[system-2-thinking]]
|
||
- [[system-message-abuse]]
|
||
- [[szemerédi-regularity-lemma]]
|
||
- [[test-time-scaling]]
|
||
- [[test-time-training-rl]]
|
||
- [[time-variant-dynamics]]
|
||
- [[token-efficiency]]
|
||
- [[tool-registry]]
|
||
- [[transfer-learning]]
|
||
- [[unified-rft]]
|
||
- [[unsupervised-rlvr]]
|
||
- [[userspace-kernel]]
|
||
- [[van-der-waerden-theorem]]
|
||
- [[visual-primitives]]
|
||
- [[window-attention]] — 窗口注意力:仅缓存最近 Token 的朴素方案,因驱逐注意力汇而崩溃
|
||
- [[worst-case-threat-model]]
|
||
- [[x-prediction-parameterization]] — Flow Matching 中直接预测干净数据的参数化
|
||
|
||
## Papers
|
||
|
||
- [[behrouz-memory-caching-rnn]]
|
||
- [[clawless-ai-agent-security]]
|
||
- [[deepseek-v4-million-token-context]]
|
||
- [[dou-cl-bench]]
|
||
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
|
||
- [[godel-incompleteness-tutorial]]
|
||
- [[he-urlvr-sharpening-2026]]
|
||
- [[hunyuan-team-cl-bench-life]]
|
||
- [[laban-llms-corrupt-documents-delegate]] — "LLMs Corrupt Your Documents When You Delegate" — DELEGATE-52
|
||
- [[li-amd-human-perception]]
|
||
- [[liu-koopa-2023]]
|
||
- [[llm-attention-survey-2026]]
|
||
- [[nikolopoulos-spurious-predictability]]
|
||
- [[odrzywolek-eml-single-operator]]
|
||
- [[qin-prfaas-cross-datacenter]]
|
||
- [[ramsey-numbers-survey]]
|
||
- [[song-agent-network-taxonomy]]
|
||
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
|
||
- [[tao-klowden-ai-mathematical-methods]]
|
||
- [[thinking-with-visual-primitives]]
|
||
- [[zhang-hyperagents]]
|
||
- [[zhu-moda-mixture-of-depths]]
|
||
|
||
## Articles
|
||
|
||
|
||
- [[caddy-reverse-proxy-auth]]
|
||
- [[crawl4ai-open-source-web-crawler]]
|
||
- [[gpt-image2-prompt-collection]]
|
||
- [[oppo-multimodal-data-lake]]
|
||
- [[prompt-caching-architecture]]
|
||
- [[ramsey-context-construction]]
|
||
## Special Pages
|
||
|
||
- [[SCHEMA]] — Wiki 结构规范
|
||
- [[log]] — 变更日志
|
||
- [[README]] — Wiki 说明 |