This commit is contained in:
2026-06-01 10:46:01 +08:00
parent 2faf4bb002
commit e96b955fda
221 changed files with 10219 additions and 332 deletions

867
index.md
View File

@@ -1,350 +1,553 @@
# LLM Wiki
> 知识索引页面 — 自动生成
> 最后更新2026-05-15 | 总页面数:335
> 最后更新2026-05-31 | 总页面数:528
## Concepts
- [[adaptive-computation-time]] — 根据输入难度动态调整计算量的技术族ACT, PonderNet 等)
- [[additive-combinatorics]]
- [[agent-communication-stack]]
- [[agent-mediated-deception]]
- [[agent-network-memory-scope]]
- [[agent-network-taxonomy]]
- [[agent-network-topology]]
- [[agent-network-update-behavior]]
- [[agentic-systems]]
- [[ai-agent-security]]
- [[ai-alignment]]
- [[ai-mathematics]]
- [[ai-safety]]
- [[analytical-report-synthesizer]] — LLM 驱动的预测结果→分析报告自动生成器
- [[api-key-authentication]]
- [[attention-entropy-collapse]]
- [[attention-sinks]]
- [[automated-theorem-proving]]
- [[backtranslation-round-trip-relay]] — 回译接力:通过可逆编辑链评估 LLM 文档编辑保真度
- [[base-table-embedding]] — DIME 第一阶段:双路径编码捕获表内语义
- [[behrouz-memory-caching-rnn]]
- [[bidirectional-trajectory-evaluation]]
- [[bpf-syscall-interception]]
- [[cache-health-observability]]
- [[cache-hit-ratio]]
- [[cache-invalidation]]
- [[cache-safe-forking]]
- [[caddy-reverse-proxy-auth]]
- [[caddy-web-server]]
- [[cel-shading-style]]
- [[centralized-agent-architecture]]
- [[certainty-based-loss]] — 通过 argmin(loss) + argmax(certainty) 双 tick 选择实现原生自适应计算
- [[certainty-based-rewards]]
- [[chain-of-thought]]
- [[chaitin-algorithmic-information-theory]]
- [[chaitin-constant]]
- [[cl-bench-life]]
- [[classifier-free-guidance-language]] — CFG 在语言扩散模型中的应用
- [[clawless]]
- [[clawless-ai-agent-security]]
- [[coarse-grained-counting]]
- [[cognitive-architecture]]
- [[completeness-logic]]
- [[composable-base-model-architecture]] — 基础模型池 + 共享组件的可组合建模框架
- [[compressed-sparse-attention]]
- [[computability-theory]]
- [[computerized-adaptive-testing]]
- [[conditional-model-dispatcher]] — ZCP + 历史 EMA 驱动的模型选择与条件增强调度器
- [[confidence-correctness-alignment]]
- [[consistency-logic]]
- [[context-blue-clique]]
- [[context-compression]]
- [[context-learning]]
- [[context-misuse]]
- [[continuous-diffusion-language-models]] — 连续嵌入空间中的扩散语言模型
- [[continuous-thought-machine]] — CTM以神经时序动态和同步为核心计算原理的新架构
- [[continuum-hypothesis]]
- [[cramer-rao-lower-bound]]
- [[crawl4ai]]
- [[crawl4ai-open-source-web-crawler]]
- [[critical-failures]] — 关键失败稀疏但严重的错误解释了约80%的文档退化
- [[curvine-distributed-cache]]
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
- [[darwin-godel-machine]]
- [[data-slice]] — 任务特定的关系数据库子集DIME 的核心数据对象
- [[decentralized-agent-architecture]]
- [[deepseek-v4-flash]]
- [[deepseek-v4-million-token-context]]
- [[deepseek-vit]]
- [[delegate-52]] — Microsoft 基准310工作环境 × 52专业领域评估LLM委托工作就绪性
- [[delegated-work]] — 委托工作新兴LLM交互范式用户监督模型代其完成任务
- [[depth-scaling-signal-degradation]]
- [[diagonal-ramsey-number]]
- [[diagonalization-method]]
- [[dime-dynamic-in-database-modeling-engine]] — DIMENeurIDA 的核心动态建模引擎
- [[discrete-diffusion-language-models]] — 离散 token 空间中的扩散语言模型
- [[distractor-context]] — 干扰上下文:话题相关但无需编辑的文档,模拟不完美检索精度
- [[document-degradation]] — 文档退化LLM在长委托工作流中静默破坏文档内容的现象
- [[domain-knowledge-reasoning]]
- [[domain-specific-evaluation]] — 领域特定评估:每个领域自定义解析器和语义等价评分的评估方法
- [[dou-cl-bench]]
- [[duo-attention]]
- [[dynamic-in-database-modeling]] — 从共享组件在查询时装配定制模型的新范式
- [[dynamic-mode-decomposition]]
- [[dynamic-model-fusion]] — 上下文感知的选择性关系融合模块
- [[dynamic-relation-modeling]] — 跨表关系结构感知的消息传递
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
- [[embedded-language-flows]] — ELF: 连续嵌入流匹配语言模型
- [[eml-operator]]
- [[empirical-discovery-simulation]]
- [[ensemble-based-rewards]]
- [[evolutionary-algorithms]]
- [[exponential-decay-reward]]
- [[few-shot-learning]]
- [[fine-grained-counting]]
- [[flash-attention]]
- [[flash-attention-3]]
- [[flow-matching]] — 连续时间流匹配生成框架
- [[formal-security-model]]
- [[formal-systems]]
- [[formal-verification]]
- [[forward-authentication]]
- [[fourier-filter-dynamics]]
- [[fp4-quantization-training]]
- [[furstenberg-correspondence]]
- [[generation-verification-asymmetry]]
- [[generative-perplexity]] — 基于第三方模型评估生成质量的指标
- [[genetic-programming]]
- [[geometric-ramsey-theory]]
- [[glitch-art-style]]
- [[godel-incompleteness-theorems]]
- [[godel-incompleteness-tutorial]]
- [[godel-numbering]]
- [[goodsteins-theorem]]
- [[gpt-image2]]
- [[gpt-image2-prompt-collection]]
- [[gravitino-unified-metadata]]
- [[greedy-context-screening]]
- [[green-tao-theorem]]
- [[group-relative-policy-optimization]]
- [[grouped-query-attention]]
- [[halftone-print-style]]
- [[halting-problem]]
- [[he-urlvr-sharpening-2026]]
- [[heavily-compressed-attention]]
- [[hilberts-program]]
- [[human-agent-trust]]
- [[human-centered-ai]]
- [[hunyuan-team-cl-bench-life]]
- [[hybrid-attention-architecture]]
- [[hyperagents]]
- [[hypergraph-ramsey-number]]
- [[identity-reference-resolution]]
- [[image-generation-prompt-design]]
- [[in-database-analytics]] — 在 DBMS 内部直接执行 ML/分析任务的方法论
- [[internal-ticks]] — 与数据维度解耦的内部时序CTM 的「思考步骤」展开维度
- [[internal-world-model]] — agent 内部构建的环境表征,在 CTM 迷宫任务中涌现
- [[intrinsic-rewards-sharpening]]
- [[jagged-frontier]] — 锯齿前沿AI模型能力在不同领域间不均衡、不可预测的分布
- [[klein-blue]]
- [[knowledge-bank]]
- [[kolmogorov-complexity]]
- [[koopman-autoencoder]]
- [[koopman-predictor]]
- [[koopman-theory]]
- [[kv-cache-bottleneck]]
- [[kvcache-transfer]]
- [[laban-llms-corrupt-documents-delegate]] — "LLMs Corrupt Your Documents When You Delegate" — DELEGATE-52
- [[length-extrapolation]] — 长度外推:让 LLM 处理超出预训练窗口的序列长度
- [[li-amd-human-perception]]
- [[linear-attention-methods]]
- [[liu-koopa-2023]]
- [[llm-applications]]
- [[llm-attention-survey-2026]]
- [[llm-evaluation-benchmarks]]
- [[log]] — 变更日志
- [[long-context-understanding]]
- [[long-horizon-evaluation]] — 长视界评估:通过延长交互揭示短评估中不可见的退化模式
- [[lost-in-the-middle]]
- [[lovasz-local-lemma]]
- [[lucas-penrose-argument]]
- [[mamba-ssm]]
- [[manifold-constrained-hyper-connections]]
- [[mathematical-pluralism]]
- [[maze-navigation]]
- [[memory-caching-rnn]]
- [[messy-context-reasoning]]
- [[meta-jctrader]]
- [[meta-learning]]
- [[metacognitive-self-modification]]
- [[metamathematics]]
- [[million-token-context]]
- [[mixture-of-attention-schemes]]
- [[mixture-of-depths-attention]]
- [[mixture-of-experts]]
- [[model-collapse-step]]
- [[multi-head-attention]]
- [[multi-head-latent-attention]]
- [[multi-query-attention]]
- [[multi-token-prediction]]
- [[multimodal-large-language-model]]
- [[muon-optimizer]]
- [[native-sparse-attention]]
- [[neural-synchronization]] — 将神经元激活历史的时序相关性直接用作潜在表示
- [[neurida]] — Neural In-Database Analytics自主端到端库内分析系统
- [[neuron-level-models]] — 每个神经元拥有私有参数的 MLP替代统一激活函数
- [[neuron-pairing]] — 对 O(D²) 同步矩阵的子采样策略,用于效率与表达力平衡
- [[neuroscience]]
- [[nikolopoulos-spurious-predictability]]
- [[non-stationary-time-series]]
- [[ntk-aware-interpolation]]
- [[odrzywolek-eml-single-operator]]
- [[on-policy-distillation]]
- [[oppo-multimodal-data-lake]]
- [[paley-graph]]
- [[paris-harrington-theorem]]
- [[path-tracing]]
- [[peano-arithmetic]]
- [[perception-gap]]
- [[pre-activation-history]] — 每个神经元维护的滚动前激活缓冲区NLM 的输入
- [[prefill-as-a-service]]
- [[prefill-decode-disaggregation]]
- [[prefix-matching]]
- [[primitive-recursive-functions]]
- [[probabilistic-method]]
- [[procedural-task-execution]]
- [[program-synthesis]]
- [[prompt-caching]]
- [[prompt-caching-architecture]]
- [[prompt-layering]]
- [[prompt-reverse-engineering]]
- [[qin-prfaas-cross-datacenter]]
- [[query-intent-analyzer]] — LLM 驱动的 NLQ 解析器,输出结构化任务/数据画像
- [[rag-systems]]
- [[ramsey-context-cache]]
- [[ramsey-context-construction]]
- [[ramsey-context-graph]]
- [[ramsey-context-template]]
- [[ramsey-numbers]]
- [[ramsey-numbers-survey]]
- [[ramsey-theory]]
- [[ramsey-theory-applications]]
- [[random-graph-theory]]
- [[README]] — Wiki 说明
- [[real-life-context-learning]]
- [[rectified-flows]] — Flow Matching 中的直线插值路径
- [[recursive-self-improvement]]
- [[reference-gap]]
- [[reinforcement-learning-trading]]
- [[relational-graph]] — 以 FK-PK 为边的元组图,关系建模的数据结构基础
- [[reverse-proxy-authentication]]
- [[reward-hacking-llm]]
- [[reward-model]]
- [[risograph-print-style]]
- [[rlvr-unified-framework]]
- [[rolling-kv-cache]] — 滚动 KV 缓存StreamingLLM 的两段式固定大小缓存机制
- [[rotary-position-embedding]]
- [[round-trip-reconstruction-score]] — RS@k衡量k次交互后文档重建质量的评估指标
- [[rule-system-application]]
- [[russells-paradox]]
- [[russian-constructivism]]
- [[SCHEMA]] — Wiki 结构规范
- [[sde-sampler-language]] — 语言扩散中的随机微分方程采样器
- [[secure-containers]]
- [[seer-attention]]
- [[self-conditioning]] — 用自身中间预测作为条件的扩散技术
- [[self-improving-ai]]
- [[self-reference]]
- [[self-verification-rewards]]
- [[semantic-equivalence]] — 语义等价:通过领域特定解析器衡量文档间语义等价程度的方法
- [[shared-weight-discretization]] — ELF 的共享权重去噪-解码机制
- [[singularity]]
- [[sink-token]] — 可学习汇 Token预训练时添加专用 Token 作为唯一注意力汇
- [[softmax-off-by-one]] — SoftMax₁允许丢弃多余注意力的 SoftMax 变体
- [[song-agent-network-taxonomy]]
- [[sparse-attention-patterns]]
- [[specialist-training-pipeline]]
- [[specialized-rl]]
- [[specialized-sft]]
- [[spiking-neural-networks]] — 使用离散脉冲和事件驱动计算的生物启发神经网络
- [[spurious-predictability]]
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
- [[stub-pattern]]
- [[subquadratic-transformer-alternatives]]
- [[symbolic-regression]]
- [[synapse-model]] — CTM 的 U-Net 风格循环突触结构,神经元间信息共享引擎
- [[system-2-thinking]]
- [[system-message-abuse]]
- [[szemerédi-regularity-lemma]]
- [[tabular-foundation-models]] — 大规模表格数据预训练的基础模型TabPFN, TabICL 等
- [[tao-klowden-ai-mathematical-methods]]
- [[temporal-decay-neural]] — 每对神经元可学习的指数衰减参数,控制同步的时间尺度
- [[test-time-scaling]]
- [[test-time-training-rl]]
- [[thinking-with-visual-primitives]]
- [[time-variant-dynamics]]
- [[token-efficiency]]
- [[tool-registry]]
- [[transfer-learning]]
- [[unified-rft]]
- [[unsupervised-rlvr]]
- [[userspace-kernel]]
- [[van-der-waerden-theorem]]
- [[visual-primitives]]
- [[window-attention]] — 窗口注意力:仅缓存最近 Token 的朴素方案,因驱逐注意力汇而崩溃
- [[worst-case-threat-model]]
- [[x-prediction-parameterization]] — Flow Matching 中直接预测干净数据的参数化
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
- [[zero-cost-proxies]] — 无需完整训练即可估计模型性能的 NAS 技术
- [[zhang-hyperagents]]
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
- [[zhu-moda-mixture-of-depths]]
- [[action-applicability]] — Action Applicability (动作合法性判定)
- [[active-cache-warmup]] — Active Cache Warm-up (主动缓存预热)
- [[adaptive-computation-time]] — Adaptive Computation Time (ACT)
- [[adaptive-harness-simplification]] — Adaptive Harness Simplification自适应 Harness 简化)
- [[additive-combinatorics]] — Additive Combinatorics加法组合学
- [[agent-capability-stability-gap]] — Agent Capability-Stability Gap能力-稳定性差距)
- [[agent-communication-stack]] — Agent通信协议栈
- [[agent-completion-evaluation]] — Agent Completion EvaluationAgent 完成度评测)
- [[agent-computer-interface]] — Agent-Computer Interface (ACI)
- [[agent-eval-case-design]] — Agent Eval Case Design
- [[agent-eval-grader]] — Agent Eval Grader
- [[agent-eval-trace]] — Agent Eval Trace
- [[agent-evaluation-paradigm-shift]] — Agent 评测范式转变Paradigm Shift in Agent Evaluation
- [[agent-frameworks-to-platforms]] — Agent Frameworks to Platforms从 Agent 框架到 Agent 平台)
- [[agent-governance]] — Agent GovernanceAgent 治理与安全)
- [[agent-harness-engineering]] — Agent Harness EngineeringAgent 执行骨架工程)
- [[agent-harness-mini]] — Mini Agent Harness
- [[agent-mediated-deception]] — 代理中介欺骗 (Agent-Mediated Deception)
- [[agent-multidimensional-capability]] — Agent Multidimensional CapabilityAgent 多维能力)
- [[agent-network-memory-scope]] — Agent网络记忆范围
- [[agent-network-taxonomy]] — Agent网络三层分类法
- [[agent-network-topology]] — Agent网络拓扑
- [[agent-network-update-behavior]] — Agent网络更新行为
- [[agent-observability]] — Agent ObservabilityAgent 可观测性)
- [[agent-process-evaluation]] — Agent Process Evaluation过程评测
- [[agent-robustness-evaluation]] — Agent Robustness EvaluationAgent 鲁棒性评测)
- [[agent-safety-evaluation]] — Agent Safety EvaluationAgent 安全评测)
- [[agent-sandbox]] — Agent SandboxAgent 沙箱)
- [[agent-symbolic-learning]] — Agent Symbolic Learning (Agent 符号学习)
- [[agent-verification]] — Agent VerificationAgent 验证与评估)
- [[agentic-systems]] — Agentic Systems智能体系统
- [[ai-agent-security]] — AI代理安全
- [[ai-alignment]] — AI Alignment (AI对齐)
- [[ai-mathematics]] — AI and Mathematics (AI 与数学)
- [[ai-safety]] — AI Safety (AI安全)
- [[amortized-variational-inference]] — Amortized Variational Inference摊销变分推断
- [[analytical-report-synthesizer]] — Analytical Report Synthesizer
- [[anthropic-agent-evals]] — Anthropic Agent Evals
- [[api-key-authentication]] — API Key 认证 (API Key Authentication)
- [[asynchronous-rl-llm]] — 异步强化学习与大语言模型后训练
- [[attention-entropy-collapse]] — 注意力熵崩溃 (Attention Entropy Collapse)
- [[attention-sinks]] — 注意力汇 (Attention Sinks)
- [[autoharness]] — AutoHarness
- [[automated-theorem-proving]] — 自动定理证明 (Automated Theorem Proving, ATP)
- [[backtranslation-round-trip-relay]] — Backtranslation Round-Trip Relay
- [[base-table-embedding]] — Base Table Embedding
- [[bayesian-attention-geometry]] — Bayesian Attention Geometry (贝叶斯注意力几何)
- [[bayesian-attention-trilogy]] — Bayesian Attention Trilogy
- [[bayesian-wind-tunnels]] — Bayesian Wind Tunnels
- [[belief-accumulation]] — Belief Accumulation (信念累积)
- [[belief-transport]] — Belief Transport (信念传输)
- [[bidirectional-trajectory-evaluation]] — 双向轨迹评估 (Bidirectional Trajectory Evaluation)
- [[binding-constraint-thesis]] — Binding-Constraint Thesis约束瓶颈论
- [[bpf-syscall-interception]] — BPF系统调用拦截
- [[bypass-network-handle-distribution]] — Bypass Network Handle Distribution (旁路网络句柄分发)
- [[cache-cold-start]] — Cache Cold-Start (缓存冷启动)
- [[cache-health-observability]] — Cache Health Observability缓存健康度可观测性
- [[cache-hit-ratio]] — Cache Hit Ratio (CHR)
- [[cache-invalidation]] — Cache Invalidation缓存失效
- [[cache-safe-forking]] — Cache-Safe Forking缓存安全分叉
- [[caddy-web-server]] — Caddy Web Server
- [[capability-control-tradeoff]] — Capability-Control Tradeoff能力-控制权衡)
- [[capability-degradation]] — 能力退化 (Capability Degradation)
- [[cel-shading-style]] — 赛璐璐风格 (Cel-Shading)
- [[centralized-agent-architecture]] — 集中式Agent架构
- [[certainty-based-loss]] — Certainty-Based Loss
- [[certainty-based-rewards]] — 确定性奖励 (Certainty-Based Rewards)
- [[chain-of-thought]] — 思维链 (Chain-of-Thought, CoT)
- [[chaitin-algorithmic-information-theory]] — 算法信息论 (Algorithmic Information Theory, AIT)
- [[chaitin-constant]] — 蔡廷常数 Ω (Chaitin's Constant)
- [[cl-bench-life]] — CL-Bench Life
- [[classifier-free-guidance-language]] — Classifier-Free Guidance for Language
- [[clawless]] — ClawLess
- [[coarse-grained-counting]] — 粗粒度计数 (Coarse-grained Counting)
- [[coarse-to-fine-granularity]] — Coarse-to-Fine Granularity
- [[code-as-harness]] — Code as Harness
- [[cognitive-architecture]] — Cognitive Architecture (认知架构)
- [[compiled-ai-paradigm]] — Compiled AI Paradigm (编译型 AI 范式)
- [[completeness-logic]] — 完备性 (Completeness, 逻辑学)
- [[composable-base-model-architecture]] — Composable Base Model Architecture
- [[compressed-sparse-attention]] — Compressed Sparse Attention (CSA)
- [[computability-theory]] — 可计算性理论 (Computability Theory)
- [[computer-use-agents]] — Computer Use Agents (CUAs)
- [[computerized-adaptive-testing]] — Computerized Adaptive Testing (CAT)
- [[conditional-model-dispatcher]] — Conditional Model Dispatcher
- [[confidence-correctness-alignment]] — 置信度-正确性对齐 (Confidence-Correctness Alignment)
- [[consistency-logic]] — 一致性 (Consistency, 逻辑学)
- [[context-blue-clique]] — Context Blue Clique上下文蓝色团
- [[context-compression]] — Context Compression上下文压缩
- [[context-drift]] — Context Drift上下文漂移
- [[context-engineering]] — Context Engineering上下文工程
- [[context-learning]] — 上下文学习 (Context Learning)
- [[context-management]] — Context Management上下文管理
- [[context-misuse]] — 上下文误用 (Context Misuse)
- [[context-pruning]] — Context Pruning (上下文剪枝)
- [[context-state-estimation]] — Context as State Estimation上下文作为状态估计
- [[continuous-diffusion-language-models]] — Continuous Diffusion Language Models
- [[continuous-thought-machine]] — Continuous Thought Machine (CTM)
- [[continuum-hypothesis]] — 连续统假设 (Continuum Hypothesis, CH)
- [[controlled-autonomy]] — Controlled Autonomy (受控的自主性)
- [[cost-quality-speed-trilemma]] — Cost-Quality-Speed Trilemma成本-质量-速度三元悖论)
- [[covariance-matrix]] — 协方差矩阵 (Covariance Matrix)
- [[covariance-matrix-knowledge]] — 协方差矩阵知识存储 (Covariance Matrix Knowledge Storage)
- [[cramer-rao-lower-bound]] — Cramér-Rao Lower Bound (CRLB)
- [[crawl4ai]] — Crawl4AI
- [[critical-failures]] — Critical Failures / 关键失败
- [[curvine-distributed-cache]] — Curvine 云原生分布式缓存
- [[darwin-godel-machine]] — Darwin Gödel Machine (达尔文·哥德尔机)
- [[data-hierarchical-governance]] — Data Hierarchical Governance (L0-L4 数据分级治理)
- [[data-label-consistency]] — Data-Label Consistency (数据-标签一致性)
- [[data-quality-over-scale]] — Data Quality over Scale (数据质量重于规模)
- [[data-replay]] — 数据回放 (Data Replay)
- [[data-slice]] — Data Slice
- [[decentralized-agent-architecture]] — 去中心化Agent架构
- [[deep-and-wide-reasoning]] — Deep-and-Wide Reasoning深度且宽广的推理
- [[deep-thinking-sft]] — Deep-Thinking SFT (深思考SFT数据)
- [[deepseek-v4-flash]] — DeepSeek-V4-Flash
- [[deepseek-vit]] — DeepSeek-ViT
- [[delegate-52]] — DELEGATE-52
- [[delegated-work]] — Delegated Work / 委托工作
- [[depth-scaling-signal-degradation]] — LLM 深度扩展与信号退化
- [[dgae]] — Difficulty-Balanced Group Advantage Estimation (DGAE)
- [[dgpo]] — Difficulty-Aware Group Policy Optimization (DGPO)
- [[diagonal-ramsey-number]] — Diagonal Ramsey Number对角拉姆齐数
- [[diagonalization-method]] — 对角线方法 (Diagonalization Method)
- [[dime-dynamic-in-database-modeling-engine]] — DIME (Dynamic In-Database Modeling Engine)
- [[discrete-diffusion-language-models]] — discrete-diffusion-language-models
- [[distractor-context]] — Distractor Context / 干扰上下文
- [[distributed-cache-routing]] — Distributed Cache Routing (分布式缓存路由)
- [[distributed-optimistic-locking]] — Distributed Optimistic Locking (分布式乐观锁)
- [[distributed-prompt-caching]] — Distributed Prompt Caching (分布式提示词缓存)
- [[distribution-shift]] — Distribution Shift分布偏移
- [[document-degradation]] — Document Degradation / 文档退化
- [[domain-knowledge-reasoning]] — 领域知识推理 (Domain Knowledge Reasoning)
- [[domain-specific-evaluation]] — Domain-Specific Evaluation / 领域特定评估
- [[dominant-shuffle]] — Dominant Shuffle
- [[dqw]] — Difficulty-Aware Question-Level Weighting (DQW)
- [[dual-layer-rl]] — Dual-Layer RL (双层强化学习)
- [[dual-space-rl]] — Dual Space RL (DSRL)
- [[duo-attention]] — DuoAttention
- [[dynamic-in-database-modeling]] — Dynamic In-Database Modeling
- [[dynamic-mode-decomposition]] — Dynamic Mode Decomposition (DMD)
- [[dynamic-model-fusion]] — Dynamic Model Fusion
- [[dynamic-relation-modeling]] — Dynamic Relation Modeling
- [[embedded-language-flows]] — Embedded Language Flows (ELF)
- [[eml-operator]] — EML 算子 (Exp-Minus-Log)
- [[empirical-discovery-simulation]] — 经验发现与模拟 (Empirical Discovery & Simulation)
- [[endogenous-reasoning]] — Endogenous Reasoning内生推理
- [[ensemble-based-rewards]] — 集成奖励 (Ensemble-Based Rewards)
- [[etclovg-taxonomy]] — ETCLOVG 七层分类法
- [[evolutionary-algorithms]] — Evolutionary Algorithms (进化算法)
- [[evolving-knowledge-injection]] — 进化知识注入 (Evolving Knowledge Injection)
- [[execution-environment]] — Execution Environment执行环境与沙箱
- [[exponential-decay-reward]] — 指数衰减奖励 (Exponential Decay Reward)
- [[few-shot-learning]] — Few-Shot Learning (少样本学习)
- [[fine-grained-counting]] — 细粒度计数 (Fine-grained Counting)
- [[flash-attention]] — FlashAttention
- [[flash-attention-3]] — FlashAttention-3
- [[flow-matching]] — Flow Matching
- [[forecasting-augmentation-taxonomy]] — Forecasting Augmentation Taxonomy
- [[formal-security-model]] — 形式化安全模型
- [[formal-systems]] — 形式系统 (Formal System)
- [[formal-verification]] — Formal Verification (形式化验证)
- [[forward-authentication]] — 外部认证委托 (Forward Authentication)
- [[fourier-filter-dynamics]] — Fourier Filter for DynamicsFourier Filter 动力学分解)
- [[fp4-quantization-training]] — FP4 Quantization-Aware Training
- [[freqmask-freqmix]] — FreqMask / FreqMix
- [[furstenberg-correspondence]] — Furstenberg Correspondence Principle
- [[generation-verification-asymmetry]] — 生成-验证不对称性 (Generation-Verification Asymmetry)
- [[generative-general-unification]] — Generative-General-Unification (GenAI 三支柱)
- [[generative-perplexity]] — generative-perplexity
- [[genetic-programming]] — Genetic Programming (遗传编程)
- [[geometric-ramsey-theory]] — Geometric Ramsey Theory几何拉姆齐理论
- [[gflownet-fine-tuning]] — GFlowNet 微调
- [[glitch-art-style]] — 故障艺术 (Glitch Art)
- [[global-context-hash-tree]] — Global Context Hash Tree (全局上下文哈希树)
- [[godel-incompleteness-theorems]] — 哥德尔不完备定理 (Gödel's Incompleteness Theorems)
- [[godel-numbering]] — 哥德尔编码 (Gödel Numbering)
- [[goodsteins-theorem]] — 古德斯坦定理 (Goodstein's Theorem)
- [[governance-security]] — Governance & Security治理与安全
- [[gpt-image2]] — GPT-Image-2
- [[gradient-alignment]] — Gradient Alignment (PreRL)
- [[gram-generative-recursive-reasoning]] — GRAMGenerative Recursive reAsoning Models
- [[gravitino-unified-metadata]] — Gravitino 统一元数据管理
- [[greedy-context-screening]] — Greedy Context Screening贪心上下文筛选
- [[green-tao-theorem]] — Green-Tao Theorem
- [[group-relative-policy-optimization]] — 群体相对策略优化 (GRPO)
- [[grouped-query-attention]] — Grouped-Query Attention (GQA)
- [[grpo]] — Group Relative Policy Optimization (GRPO)
- [[gui-tool-hybrid-action-space]] — GUI-Tool Hybrid Action Space
- [[halftone-print-style]] — 半调印刷风格 (Halftone Print Style)
- [[halting-problem]] — 停机问题 (Halting Problem)
- [[hardening-execution-environments]] — Hardening Execution Environments硬化执行环境
- [[harness-as-action-verifier]] — Harness-as-Action-Verifier
- [[harness-as-policy]] — Harness-as-Policy (Code as Policy)
- [[harness-coupling-problem]] — Harness Coupling ProblemHarness 耦合问题)
- [[harness-engineering]] — Harness Engineering
- [[hars]] — HARS调和适应保留评分
- [[heavily-compressed-attention]] — Heavily Compressed Attention (HCA)
- [[held-out-validation-gate]] — Held-Out Validation Gate (留出验证门)
- [[heuristic-learning]] — Heuristic Learning (启发式学习)
- [[hilberts-program]] — 希尔伯特计划 (Hilbert's Program)
- [[human-agent-trust]] — 人机信任 (Human-Agent Trust)
- [[human-centered-ai]] — Human-Centered AI (以人类为中心的 AI)
- [[hybrid-attention-architecture]] — Hybrid Attention Architecture
- [[hyperagents]] — Hyperagents (超智能体)
- [[hypergraph-ramsey-number]] — Hypergraph Ramsey Number超图拉姆齐数
- [[identity-reference-resolution]] — 身份指代消解 (Identity Reference Resolution)
- [[image-generation-prompt-design]] — 图像生成 Prompt 设计
- [[in-database-analytics]] — In-Database Analytics
- [[inference-primitives]] — Inference Primitives (推理原语)
- [[inference-time-scaling]] — Inference-Time Scaling推理时扩展
- [[input-superposition]] — Input Superposition
- [[interleaved-gui-tool-trajectory-scaling]] — Interleaved GUI-Tool Trajectory Scaling Pipeline
- [[internal-ticks]] — Internal Ticks
- [[internal-world-model]] — Internal World Model
- [[intrinsic-rewards-sharpening]] — 内在奖励锐化机制 (Intrinsic Rewards Sharpening)
- [[iterative-code-refinement]] — Iterative Code Refinement (迭代代码精炼)
- [[jagged-frontier]] — Jagged Frontier / 锯齿前沿
- [[klein-blue]] — 克莱因蓝 (Klein Blue / IKB)
- [[knowledge-adaptation]] — 知识适应 (Knowledge Adaptation)
- [[knowledge-agnostic-augmentation]] — 知识无关增强 (Knowledge-Agnostic Augmentation)
- [[knowledge-aware-augmentation]] — 知识感知增强 (Knowledge-Aware Augmentation)
- [[knowledge-bank]] — Knowledge Bank — AI 辅助开发时代的知识管理系统
- [[knowledge-internalization]] — 知识内化 (Knowledge Internalization)
- [[knowledge-retention]] — 知识保留 (Knowledge Retention)
- [[knowledge-tree]] — 知识树 (Knowledge Tree)
- [[kolmogorov-complexity]] — 柯尔莫哥洛夫复杂度 (Kolmogorov Complexity)
- [[koopman-autoencoder]] — Koopman Autoencoder (KAE)
- [[koopman-predictor]] — Koopman PredictorKoopman 预测器)
- [[koopman-theory]] — Koopman TheoryKoopman 理论)
- [[kore-augmentation]] — KORE-AUGMENTATION知识导向增强
- [[kore-constraint]] — KORE-CONSTRAINT知识导向约束
- [[kv-cache-bottleneck]] — KV 缓存内存瓶颈
- [[kvcache-transfer]] — KVCache 传输与优化
- [[language-gradient]] — Language Gradient (语言梯度)
- [[language-loss]] — Language Loss (语言损失)
- [[latent-variable-generative-model]] — Latent-Variable Generative Model潜在变量生成模型
- [[length-extrapolation]] — 长度外推 (Length Extrapolation)
- [[lifecycle-orchestration]] — Lifecycle & Orchestration生命周期与编排
- [[linear-attention-methods]] — 线性注意力方法 (Linear Attention Methods)
- [[llm-applications]] — LLM 应用
- [[llm-evaluation-benchmarks]] — LLM 评测基准体系
- [[long-context-understanding]] — 长上下文理解 (Long-Context Understanding)
- [[long-horizon-evaluation]] — Long-Horizon Evaluation / 长视界评估
- [[lost-in-the-middle]] — Lost in the Middle
- [[lovasz-local-lemma]] — Lovász Local Lemma
- [[lucas-penrose-argument]] — 卢卡斯-彭罗斯论证 (Lucas-Penrose Argument)
- [[mamba-ssm]] — Mamba (State Space Model)
- [[manifold-constrained-hyper-connections]] — Manifold-Constrained Hyper-Connections (mHC)
- [[math-question-reformulation]] — 数学问题多维度改写
- [[mathematical-pluralism]] — 数学多元主义 (Mathematical Pluralism)
- [[mathforge]] — MathForge 框架
- [[maze-navigation]] — 迷宫导航 (Maze Navigation)
- [[memory-caching-rnn]] — Memory Caching (MC)
- [[messy-context-reasoning]] — 混乱上下文推理 (Messy Context Reasoning)
- [[meta-jctrader]] — Meta-JCTrader
- [[meta-learning]] — Meta-Learning (元学习)
- [[metacognitive-self-modification]] — Metacognitive Self-Modification (元认知自我修改)
- [[metamathematics]] — 元数学 (Metamathematics)
- [[million-token-context]] — Million-Token Context
- [[mixture-of-attention-schemes]] — Mixture of Attention Schemes (MoAS)
- [[mixture-of-depths-attention]] — Mixture-of-Depths Attention (MoDA)
- [[mixture-of-experts]] — Mixture of Experts (MoE)
- [[mme-voke]] — MMEVOKE
- [[model-collapse-step]] — 模型崩溃步 (Model Collapse Step, MCS)
- [[model-harness-relationship]] — Model-Harness Relationship (模型与Harness关系)
- [[moe-lora]] — MoELoRA
- [[mqr]] — Multi-Aspect Question Reformulation (MQR)
- [[multi-agent-orchestration]] — Multi-Agent Orchestration多 Agent 编排
- [[multi-head-attention]] — Multi-Head Attention (MHA)
- [[multi-head-latent-attention]] — Multi-head Latent Attention (MLA)
- [[multi-hot-cross-entropy]] — Multi-hot Cross-Entropy (MCE)
- [[multi-query-attention]] — Multi-Query Attention (MQA)
- [[multi-solution-recovery]] — Multi-Solution Recovery多解恢复
- [[multi-token-prediction]] — Multi-Token Prediction (MTP)
- [[multi-trajectory-inference]] — Multi-Trajectory Inference多轨迹推理
- [[multimodal-large-language-model]] — 多模态大语言模型 (MLLM)
- [[multimodal-rag]] — 多模态 RAG (Multimodal RAG)
- [[muon-optimizer]] — Muon Optimizer
- [[native-sparse-attention]] — Native Sparse Attention (NSA)
- [[negative-sample-reinforcement]] — Negative Sample Reinforcement (NSR)
- [[neural-synchronization]] — Neural Synchronization as Representation
- [[neurida]] — NeurIDA
- [[neuron-level-models]] — Neuron-Level Models (NLMs)
- [[neuron-pairing]] — Neuron Pairing
- [[neuroscience]] — Neuroscience (神经科学)
- [[next-state-grounding]] — Next-State Grounding
- [[non-stationary-time-series]] — Non-stationary Time Series非平稳时间序列
- [[ntk-aware-interpolation]] — NTK-aware 位置编码插值
- [[null-space]] — 零空间 (Null Space)
- [[null-space-projection-knowledge]] — 零空间投影知识保留 (Null Space Projection for Knowledge Retention)
- [[observability]] — Observability & Operations可观测性与运维
- [[off-policy-llm-post-training]] — Off-Policy LLM 后训练
- [[on-policy-distillation]] — On-Policy Distillation (OPD)
- [[on-policy-learning-collapse]] — On-policy Learning Collapse
- [[optimal-gui-tool-path-selection]] — Optimal GUI-Tool Path Selection
- [[osworld-mcp]] — OSWorld-MCP Benchmark
- [[paley-graph]] — Paley Graph
- [[paris-harrington-theorem]] — Paris-Harrington Theorem巴黎-哈灵顿定理)
- [[pass-at-k-vs-pass-k]] — Pass@k vs Pass^k能力上限 vs 可靠性下限)
- [[path-tracing]] — 路径追踪 (Path Tracing)
- [[peano-arithmetic]] — 皮亚诺算术 (Peano Arithmetic, PA)
- [[perception-gap]] — 感知鸿沟 (Perception Gap)
- [[policy-reincarnation]] — Policy Reincarnation
- [[positive-sample-reinforcement]] — Positive Sample Reinforcement (PSR)
- [[post-train-space-rl]] — Post-train Space Reinforcement Learning
- [[practitioner-research-gap]] — Practitioner-Research Gap从业者-研究鸿沟)
- [[pre-activation-history]] — Pre-Activation History
- [[pre-train-space-reinforcement-learning]] — Pre-train Space Reinforcement Learning (PreRL)
- [[prefill-as-a-service]] — Prefill-as-a-Service (PrfaaS)
- [[prefill-decode-disaggregation]] — Prefill-Decode 分离架构 (PD Disaggregation)
- [[prefix-matching]] — Prefix Matching前缀匹配
- [[primitive-completeness]] — Primitive Completeness (原语完备性)
- [[primitive-recursive-functions]] — 原始递归函数 (Primitive Recursive Functions)
- [[probabilistic-method]] — Probabilistic Method概率方法
- [[procedural-task-execution]] — 程序性任务执行 (Procedural Task Execution)
- [[program-synthesis]] — Program Synthesis (程序合成)
- [[prompt-caching]] — Prompt Caching
- [[prompt-layering]] — Prompt Layering提示分层
- [[prompt-reverse-engineering]] — 图片反推 Prompt (Prompt Reverse Engineering)
- [[prompt-to-harness-evolution]] — Prompt-to-Harness Evolution三阶段工程演进
- [[query-intent-analyzer]] — Query Intent Analyzer
- [[question-quality-vs-quantity]] — Question Quality vs. Quantity问题质量 vs 数量)
- [[rag-systems]] — RAG 系统
- [[ramsey-context-cache]] — Ramsey Context Cache拉姆齐上下文缓存
- [[ramsey-context-graph]] — Ramsey Context Graph拉姆齐上下文图
- [[ramsey-context-template]] — Ramsey Context Template拉姆齐上下文模板
- [[ramsey-numbers]] — Ramsey Numbers拉姆齐数
- [[ramsey-theory]] — Ramsey Theory拉姆齐理论
- [[ramsey-theory-applications]] — Ramsey Theory Applications拉姆齐理论应用
- [[random-access-binding]] — Random-Access Binding (随机访问绑定)
- [[random-graph-theory]] — Random Graph Theory随机图理论
- [[real-life-context-learning]] — 真实生活上下文学习 (Real-Life Context Learning)
- [[rectified-flows]] — Rectified Flows
- [[recursive-reasoning-models]] — Recursive Reasoning Models递归推理模型
- [[recursive-self-improvement]] — Recursive Self-Improvement (递归自我改进)
- [[reference-gap]] — 引用鸿沟 (Reference Gap)
- [[reinforcement-learning-trading]] — Reinforcement Learning Trading强化学习交易
- [[rejected-edit-buffer]] — Rejected-Edit Buffer (拒绝编辑缓冲)
- [[relational-graph]] — Relational Graph
- [[reliable-state-long-running-agents]] — Reliable State in Long-Running Agents长期运行中的可靠状态
- [[replay-buffer-rl-llm]] — Replay Buffer 在 LLM RL 中的应用
- [[representation-alignment]] — Representation Alignment
- [[reverse-proxy-authentication]] — 反向代理认证 (Reverse Proxy Authentication)
- [[reward-hacking-llm]] — LLM 奖励黑客 (Reward Hacking in LLMs)
- [[reward-model]] — 奖励模型 (Reward Model, RM)
- [[reward-recency-sampling]] — 奖励-最近度混合采样
- [[risograph-print-style]] — Riso 印刷风格 (Risograph Print Style)
- [[rlvr-unified-framework]] — RLVR 统一理论框架
- [[rolling-kv-cache]] — 滚动 KV 缓存 (Rolling KV Cache)
- [[rotary-position-embedding]] — 旋转位置编码 (RoPE)
- [[round-trip-reconstruction-score]] — Round-Trip Reconstruction Score (RS@k)
- [[rule-system-application]] — 规则系统应用 (Rule System Application)
- [[russells-paradox]] — 罗素悖论 (Russell's Paradox)
- [[russian-constructivism]] — 俄国构成主义 (Russian Constructivism)
- [[s-token]] — S-Token (Superposed Token)
- [[sde-sampler-language]] — SDE Sampler for Language Diffusion
- [[searcher-trainer-decoupling]] — Searcher-Trainer 解耦架构
- [[secure-containers]] — 安全容器
- [[seer-attention]] — SeerAttention
- [[self-conditioning]] — Self-Conditioning
- [[self-evolving-agents]] — Self-Evolving Agents (自进化 Agent)
- [[self-evolving-benchmark]] — 自进化基准 (Self-Evolving Benchmark)
- [[self-improving-ai]] — Self-Improving AI (自我改进人工智能)
- [[self-reference]] — 自指 (Self-Reference)
- [[self-verification-rewards]] — 自我验证奖励 (Self-Verification Rewards)
- [[semantic-equivalence]] — Semantic Equivalence / 语义等价
- [[shadow-calling]] — Shadow Calling (影子调用)
- [[shared-parameter-influence]] — Shared Parameter Influence
- [[shared-weight-discretization]] — Shared-Weight Discretization
- [[singularity]] — Singularity (奇点)
- [[sink-token]] — 汇 Token (Sink Token)
- [[skill-as-external-state]] — Skill as External State (Skill 作为外部状态)
- [[skill-data-flywheel]] — Skill Data Flywheel (Skill 数据飞轮)
- [[skill-ecosystem]] — Skill Ecosystem (Skill 生态系统)
- [[skillopt]] — SkillOpt
- [[slow-meta-update]] — Slow/Meta Update (慢/元更新)
- [[softmax-off-by-one]] — SoftMax-off-by-One
- [[sparse-attention-patterns]] — 稀疏注意力模式 (Sparse Attention Patterns)
- [[specialist-training-pipeline]] — Specialist Training Pipeline
- [[specialized-rl]] — 专项强化学习 (Specialized RL)
- [[specialized-sft]] — 专项监督微调 (Specialized SFT)
- [[spiking-neural-networks]] — Spiking Neural Networks (SNN)
- [[spurious-predictability]] — Spurious Predictability
- [[stage-matched-data-config]] — Stage-Matched Data Configuration (分阶段数据配置)
- [[standard-agent-handoffs]] — Standard Agent Handoffs标准化 Agent 交接)
- [[staug]] — STAug (EMD-based Augmentation)
- [[stochastic-latent-trajectory]] — Stochastic Latent Trajectory随机潜在轨迹
- [[strategy-engineering-unification]] — Strategy-Engineering Unification (策略与工程统一)
- [[structured-knowledge]] — 结构化知识 (Structured Knowledge)
- [[stub-pattern]] — Stub Pattern轻量化桩模式
- [[subquadratic-transformer-alternatives]] — 次二次 Transformer 替代方案
- [[sufficient-context-paradox]] — 充分上下文悖论 (Sufficient Context Paradox)
- [[swe-bench]] — SWE-bench
- [[symbolic-backpropagation]] — Symbolic Back-Propagation (符号反向传播)
- [[symbolic-network]] — Symbolic Network (符号网络)
- [[symbolic-regression]] — Symbolic Regression
- [[synapse-model]] — Synapse Model
- [[synthetic-data-qa-generation]] — Synthetic Data QA Generation (合成数据Q&A生成)
- [[system-2-thinking]] — System 2 思维
- [[system-message-abuse]] — System Message Abuse系统消息滥用
- [[szemerédi-regularity-lemma]] — Szemerédi Regularity Lemma
- [[tabular-foundation-models]] — Tabular Foundation Models
- [[tba]] — Trajectory Balance with Asynchrony (TBA)
- [[temporal-decay-neural]] — Temporal Decay (Neural)
- [[temporal-patch-shuffle]] — Temporal Patch Shuffle (TPS)
- [[terminal-bench]] — Terminal-Bench
- [[test-time-scaling]] — Test-Time Scaling
- [[test-time-training-rl]] — 测试时训练 RL (Test-Time Training with RL)
- [[text-space-optimizer]] — Text-Space Optimizer (文本空间优化器)
- [[text-vs-weight-optimization]] — Text vs Weight Optimization (文本 vs 权重优化)
- [[textual-learning-rate]] — Textual Learning Rate (文本学习率)
- [[thompson-sampling-code-search]] — Thompson Sampling Code Search
- [[three-engineering-phases]] — Three Engineering Phases三阶段工程演进
- [[throughput-hypothesis]] — Throughput Hypothesis (吞吐量假说)
- [[time-series-forecasting-augmentation]] — Time Series Forecasting Augmentation
- [[time-variant-dynamics]] — Time-variant Dynamics时变动力学
- [[token-efficiency]] — Token 效率 (Token Efficiency)
- [[token-superposition-training]] — Token Superposition Training (TST)
- [[tool-bootstrapped-rft]] — Tool-Bootstrapped GUI RFT
- [[tool-efficient-path-reward]] — Tool-Efficient Path Reward
- [[tool-interface]] — Tool Interface & Protocol Layer工具接口与协议层
- [[tool-registry]] — ToolRegistry
- [[trace-native-evaluation]] — Trace-Native Evaluation踪迹原生评估
- [[trading-lifecycle-driven-eviction]] — Trading-Lifecycle Driven Eviction (交易生命周期驱动淘汰)
- [[trajectory-balance-objective]] — Trajectory Balance (TB) 目标
- [[transfer-learning]] — Transfer Learning (迁移学习)
- [[two-phase-pretraining]] — Two-Phase Pre-Training
- [[ultradata]] — UltraData
- [[unconditional-generation-latent]] — Unconditional Generation via Latent Reasoning
- [[unified-rft]] — 统一拒绝采样微调 (Unified RFT)
- [[unsupervised-rlvr]] — 无监督可验证奖励强化学习 (URLVR)
- [[update-magnitude-imbalance]] — GRPO 更新幅度不平衡
- [[userspace-kernel]] — 用户空间内核
- [[van-der-waerden-theorem]] — van der Waerden Theorem
- [[verification-evaluation]] — Verification & Evaluation验证与评估
- [[visual-primitives]] — 视觉原语 (Visual Primitives)
- [[wavemask-wavemix]] — WaveMask / WaveMix
- [[width-based-scaling]] — Width-Based Scaling宽度扩展
- [[window-attention]] — 窗口注意力 (Window Attention)
- [[worst-case-threat-model]] — 最坏情况威胁模型
- [[x-prediction-parameterization]] — x-Prediction Parameterization
- [[zero-cost-proxies]] — Zero-Cost Proxies (ZCP)
## Papers
- [[behrouz-memory-caching-rnn]]
- [[clawless-ai-agent-security]]
- [[darlow-ctm-2025]] — CTM: 以神经同步为表示的持续思考机器 (NeurIPS 2025)
- [[deepseek-v4-million-token-context]]
- [[dou-cl-bench]]
- [[elf-embedded-language-flows]] — ELF: 连续嵌入空间中的 Flow Matching 语言扩散模型 (2026)
- [[godel-incompleteness-tutorial]]
- [[he-urlvr-sharpening-2026]]
- [[hunyuan-team-cl-bench-life]]
- [[laban-llms-corrupt-documents-delegate]] — "LLMs Corrupt Your Documents When You Delegate" — DELEGATE-52
- [[li-amd-human-perception]]
- [[liu-koopa-2023]]
- [[llm-attention-survey-2026]]
- [[nikolopoulos-spurious-predictability]]
- [[odrzywolek-eml-single-operator]]
- [[qin-prfaas-cross-datacenter]]
- [[ramsey-numbers-survey]]
- [[song-agent-network-taxonomy]]
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的无限长流式语言模型推理框架 (ICLR 2024)
- [[tao-klowden-ai-mathematical-methods]]
- [[thinking-with-visual-primitives]]
- [[xing-trails-2024]] — Trails: 数据库原生的深度神经网络模型选择 (VLDB 2024)
- [[zeng-dynamic-model-slicing-2024]] — 数据库内的动态模型切片技术 (VLDB 2024)
- [[zeng-neurida-2025]] — NeurIDA: 动态库内建模实现有效的关系数据库分析
- [[zhang-hyperagents]]
- [[zhao-neurdb-2025]] — NeurDB: AI 驱动的自主数据库 (CIDR 2025)
- [[zhu-moda-mixture-of-depths]]
- [[agarwal-bayesian-attention-geometry]] — The Bayesian Geometry of Transformer Attention
- [[agent-harness-engineering-survey]] — Agent Harness Engineering: A Survey
- [[bartoldson-tba-2025]] — TBA: 异步轨迹平衡 — 解耦探索与学习以实现快速可扩展的 LLM 后训练
- [[behrouz-memory-caching-rnn]] — Memory Caching: RNNs with Growing Memory
- [[clawless-ai-agent-security]] — ClawLess: AI 代理安全模型
- [[dai-mathforge-2026]] — MathForge: Harder Is Better — 难度感知GRPO与多维度问题改写
- [[darlow-ctm-2025]] — Continuous Thought Machines (CTM)
- [[deepseek-v4-million-token-context]] — DeepSeek-V4: 迈向高效百万 Token 上下文智能
- [[dou-cl-bench]] — CL-bench: 上下文学习基准——首篇定义context learning范式的论文
- [[elf-embedded-language-flows]] — ELF: Embedded Language Flows
- [[godel-incompleteness-tutorial]] — 哥德尔不完备定理教程
- [[gram-generative-recursive-reasoning-paper]] — Generative Recursive Reasoning (GRAM)
- [[he-urlvr-sharpening-2026]] — How Far Can Unsupervised RLVR Scale LLM Training?
- [[hunyuan-team-cl-bench-life]] — CL-Bench Life: 真实生活上下文学习基准
- [[kore-knowledge-injection]] — KORE: Knowledge-Oriented Controls for Knowledge Injection
- [[laban-llms-corrupt-documents-delegate]] — LLMs Corrupt Your Documents When You Delegate
- [[li-amd-human-perception]] — "Are You Sure?": Human Perception Vulnerability in LLM Agents
- [[liu-koopa-2023]] — Koopa: Koopman 预测器驱动的非平稳时间序列学习
- [[llm-attention-survey-2026]] — 大语言模型注意力机制全面分析
- [[lou-autoharness-2026]] — AutoHarness: LLM Agent 的自动代码 Harness 合成
- [[nikolopoulos-spurious-predictability]] — Spurious Predictability in Financial Machine Learning
- [[odrzywolek-eml-single-operator]] — All elementary functions from a single binary operator
- [[peng-tst-2026]] — Token Superposition Training: 高效 LLM 预训练的 Token 叠加方法
- [[pre-train-space-reinforcement-learning]] — Pre-train Space Reinforcement Learning (PreRL/DSRL)
- [[qin-prfaas-cross-datacenter]] — Prefill-as-a-Service: KVCache Goes Cross-Datacenter
- [[ramsey-numbers-survey]] — 拉姆齐数的数学综述
- [[song-agent-network-taxonomy]] — Complex networks of AI agentic systems: 拓扑-记忆-更新三层分类法
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的高效流式语言模型
- [[tao-klowden-ai-mathematical-methods]] — Mathematical methods and human thought in the age of AI
- [[thinking-with-visual-primitives]] — Thinking with Visual Primitives — 以视觉原语思考
- [[toolcua-optimal-gui-tool-orchestration]] — ToolCUA: Optimal GUI-Tool Path Orchestration for Computer Use Agents
- [[when-large-multimodal-models-confront-evolving-knowledge]] — When Large Multimodal Models Confront Evolving Knowledge
- [[xing-trails-2024]] — Trails: Database Native Model Selection (VLDB 2024)
- [[yang-skillopt-2026]] — SkillOpt: Agent Skill 的文本空间优化器
- [[zeng-dynamic-model-slicing-2024]] — Powering In-Database Dynamic Model Slicing for Structured Data Analytics (VLDB 2
- [[zeng-neurida-2025]] — NeurIDA: Dynamic Modeling for Effective In-Database Analytics
- [[zhang-hyperagents]] — Hyperagents: Self-Referential Agents with Metacognitive Self-Modification
- [[zhao-neurdb-2025]] — NeurDB: On the Design and Implementation of an AI-powered Autonomous Database (C
- [[zhou-agent-symbolic-learning-2024]] — Agent Symbolic Learning: 用符号学习实现自进化 Agent
- [[zhu-moda-mixture-of-depths]] — Mixture-of-Depths Attention (MoDA)
## Articles
- [[caddy-reverse-proxy-auth]] — Caddy 反向代理认证方案
- [[claw-eval]] — Claw-Eval面向自主Agent的端到端评测框架
- [[crawl4ai-open-source-web-crawler]] — Crawl4AI赋能AI用户的开源智能网页爬虫与数据提取工具
- [[distributed-agent-cache-sync-2026]] — 分布式Agent缓存同步从单机到多机的Prompt Caching架构升级
- [[gpt-image2-prompt-collection]] — GPT-Image-2 绘图 Prompt 方法论与风格合集
- [[lyu-model-harness-evolution-2026]] — Model与Harness的关系演进从AutoHarness到Heuristic Learning
- [[lyu-skillopt-deep-dive-2026]] — SkillOpt深度解读自进化Agent技能的'反向传播'与工程化Continued Evolve
- [[mini-agent-harness]] — 从零搭建 Mini Agent Harness
- [[oppo-multimodal-data-lake]] — OPPO 多模态数据湖架构实践
- [[prompt-caching-architecture]] — Prompt Caching 架构工程手册
- [[ramsey-context-construction]] — 上下文构造与拉姆齐数
- [[temporal-patch-shuffle-tps]] — 时序预测增强方法综述:从频域到 TPS
- [[ultradata-l3-open-source-2026]] — UltraData面壁智能L3数据开源与数据分级治理体系
- [[caddy-reverse-proxy-auth]]
- [[crawl4ai-open-source-web-crawler]]
- [[gpt-image2-prompt-collection]]
- [[oppo-multimodal-data-lake]]
- [[prompt-caching-architecture]]
- [[ramsey-context-construction]]
## Special Pages
- [[SCHEMA]] — Wiki 结构规范
- [[log]] — 变更日志
- [[README]] — Wiki 说明
- [[index]] —
- [[log]] —
- [[README]] —
- [[SCHEMA]] —
## Reviews
- [[ctm-review-20260515]] — CTM 论文集成 Review (2026-05-15)
- [[toolcua-review-20260531]] — ToolCUA Review: GUI-Tool路径编排的概念网络分析
- [[agent-harness-engineering-review-20260523]] — Review: Agent Harness Engineering Survey
- [[agent-network-taxonomy-review-20260501]] — Agent网络三层分类法 — Review 报告
- [[cl-bench-life-review-20260501]] — CL-Bench Life 论文集成 Review
- [[cl-bench-review-20260501]] — CL-bench 论文集成 Review
- [[clawless-review-20260422]] — ClawLess: AI 代理安全模型 - Review 报告
- [[ctm-review-20260515]] — Continuous Thought Machines 论文集成 Review
- [[delegate52-review-20260514]] — DELEGATE-52 Review
- [[distributed-agent-cache-sync-review]] — Review: 分布式Agent缓存同步
- [[elf-embedded-language-flows-review-20260513]] — Review: ELF — Embedded Language Flows
- [[godel-tutorial-review-20260428]] — 哥德尔不完备定理教程 — Review 报告
- [[hyperagents-review-20260420]] — 📚 Wiki 添加 Review 报告 - Hyperagents 论文
- [[koopa-review-20260511]] — Review: Koopa — Koopman 预测器驱动的非平稳时序学习
- [[kore-review-20260521]] — KORE Review
- [[llm-attention-survey-review-20260429]] — Review: 大语言模型注意力机制全面分析
- [[lou-autoharness-review]] — Review: AutoHarness — 自动合成代码 Harness 改进 LLM Agent
- [[lyu-model-harness-review]] — Review: Model与Harness的关系演进
- [[lyu-skillopt-deep-dive-review]] — Review: SkillOpt深度解读 — 自进化Agent的'反向传播'
- [[mathforge-review-20260512]] — MathForge Review — 2026-05-12
- [[neurida-review-20260515]] — NeurIDA 论文集成 Review
- [[peng-tst-2026-review]] — Review: Token Superposition Training
- [[pretrain-space-rl-review-20260518]] — Review: Pre-train Space Reinforcement Learning
- [[prompt-caching-architecture-review-20260511]] — Review: Prompt Caching 架构工程手册
- [[ramsey-context-construction-review-20260511]] — Review: 上下文构造与拉姆齐数
- [[ramsey-numbers-survey-review-20260511]] — Review: 拉姆齐数的数学综述
- [[streaming-llm-review-20260514]] — Review: StreamingLLM — 基于注意力汇的无限长流式语言模型
- [[tba-review-20260512]] — TBA Review — 2026-05-12
- [[thinking-with-visual-primitives-review-20260430]] — Review — Thinking with Visual Primitives
- [[ultradata-l3-review]] — Review: UltraData — 大模型数据分级治理的开源实践
- [[yang-skillopt-review]] — Review: SkillOpt — Agent Skill 的文本空间优化器
- [[zhou-agent-symbolic-learning-review]] — Review: Agent Symbolic Learning — 符号学习驱动的自进化Agent