myWiki/index.md

# LLM Wiki

> 知识索引页面 — 自动生成
> 最后更新：2026-06-25 | 总页面数：1249

## Concepts

- [[4d-gaussian-splatting]] — 4D 高斯泼溅 (4D Gaussian Splatting)
- [[abductive-reasoning-recommendation]] — 溯因推理 (推荐) — Abductive Reasoning in Recommendation
- [[absolute-gating]] — 绝对门控与相对门控 (Absolute vs Relative Gating)
- [[abstract-representation-space]] — 抽象表征空间 (Abstract Representation Space)
- [[ace-router]] — ACE-Router — 训练专用路由器
- [[action-applicability]] — Action Applicability (动作合法性判定)
- [[action-consequence-prediction]] — 预测行动后果 (Action Consequence Prediction)
- [[action-decoder]] — 动作解码器 (Action Decoder)
- [[action-head-router]] — 动作头路由器 (Action Head Router)
- [[action-realization-layer]] — Action Realization Layer（动作实现层）
- [[action-routing-policy]] — 动作路由策略 (Action-Routing Policy)
- [[activation-manifold]] — Activation Manifold
- [[activation-steering]] — Activation Steering
- [[active-cache-warmup]] — Active Cache Warm-up (主动缓存预热)
- [[active-tool-discovery]] — 主动工具发现 — Active Tool Discovery
- [[active-tool-request]] — Active Tool Request — 结构化工具请求
- [[adapter-protocol]] — 适配器协议 (Adapter Protocol)
- [[adaptive-adversary]] — 自适应对手 (Adaptive Adversary)
- [[adaptive-computation-time]] — Adaptive Computation Time (ACT)
- [[adaptive-harness-simplification]] — Adaptive Harness Simplification（自适应 Harness 简化）
- [[additive-combinatorics]] — Additive Combinatorics（加法组合学）
- [[adkv]] — AdaKV
- [[agent-capability-stability-gap]] — Agent Capability-Stability Gap（能力-稳定性差距）
- [[agent-communication-stack]] — Agent通信协议栈
- [[agent-completion-evaluation]] — Agent Completion Evaluation（Agent 完成度评测）
- [[agent-computer-interface]] — Agent-Computer Interface (ACI)
- [[agent-eval-case-design]] — Agent Eval Case Design
- [[agent-eval-grader]] — Agent Eval Grader
- [[agent-eval-trace]] — Agent Eval Trace
- [[agent-evaluation-paradigm-shift]] — Agent 评测范式转变（Paradigm Shift in Agent Evaluation）
- [[agent-frameworks-to-platforms]] — Agent Frameworks to Platforms（从 Agent 框架到 Agent 平台）
- [[agent-governance]] — Agent Governance（Agent 治理与安全）
- [[agent-harness]] — Agent Harness (Claw)
- [[agent-harness-engineering]] — Agent Harness Engineering（Agent 执行骨架工程）
- [[agent-harness-mini]] — Mini Agent Harness
- [[agent-harness-safety]] — Agent Harness Safety
- [[agent-mediated-deception]] — 代理中介欺骗 (Agent-Mediated Deception)
- [[agent-memory-five-category-model]] — Agent Memory Five-Category Model (sz 设计)
- [[agent-memory-lifecycle]] — Agent 记忆生命周期
- [[agent-memory-system]] — Agent 记忆系统
- [[agent-memory-taxonomy]] — Agent Memory Taxonomy (三索引分型)
- [[agent-multidimensional-capability]] — Agent Multidimensional Capability（Agent 多维能力）
- [[agent-network-memory-scope]] — Agent网络记忆范围
- [[agent-network-taxonomy]] — Agent网络三层分类法
- [[agent-network-topology]] — Agent网络拓扑
- [[agent-network-update-behavior]] — Agent网络更新行为
- [[agent-observability]] — Agent Observability（Agent 可观测性）
- [[agent-process-evaluation]] — Agent Process Evaluation（过程评测）
- [[agent-robustness-evaluation]] — Agent Robustness Evaluation（Agent 鲁棒性评测）
- [[agent-safety-evaluation]] — Agent Safety Evaluation（Agent 安全评测）
- [[agent-sandbox]] — Agent Sandbox（Agent 沙箱）
- [[agent-skill]] — Agent Skill — 可复用过程性构件
- [[agent-skill-atomization]] — Agent Skill 原子化
- [[agent-skill-ecosystem]] — Agent Skill 生态系统
- [[agent-symbolic-learning]] — Agent Symbolic Learning (Agent 符号学习)
- [[agent-token-budget-optimization]] — Agent Token Budget Optimization
- [[agent-verification]] — Agent Verification（Agent 验证与评估）
- [[agent-web]] — Agent Web — 开放协作智能体网络
- [[agentic-cache-manager]] — Agentic Cache Manager
- [[agentic-rag]] — Agentic RAG
- [[agentic-streaming-inference]] — Agentic Streaming Inference
- [[agentic-systems]] — Agentic Systems（智能体系统）
- [[agi-critique]] — AGI 批判（AGI Critique）
- [[ai-agent-security]] — AI代理安全
- [[ai-alignment]] — AI Alignment (AI对齐)
- [[ai-mathematics]] — AI and Mathematics (AI 与数学)
- [[ai-production-tradeoffs]] — AI 生产权衡 — 六大维度
- [[ai-safety]] — AI Safety (AI安全)
- [[aidb]] — AIDB（大模型友好数据层）
- [[aleatoric-uncertainty]] — 随机不确定性 (Aleatoric Uncertainty)
- [[algebraic-numbers-countability]] — 代数数的可数性
- [[algorithmic-equity]] — 算法公平性 (Algorithmic Equity)
- [[amortized-variational-inference]] — Amortized Variational Inference（摊销变分推断）
- [[analytical-report-synthesizer]] — Analytical Report Synthesizer
- [[and-or-interactions]] — AND-OR 交互 (AND-OR Interactions)
- [[anthropic-agent-evals]] — Anthropic Agent Evals
- [[anthropomorphization-critique]] — 人类化机器批判（Anthropomorphization Critique）
- [[api-key-authentication]] — API Key 认证 (API Key Authentication)
- [[appearance-bias-vla]] — Appearance Bias in VLA
- [[arxiv]] — arXiv
- [[asymmetric-grounding-adherence-loss]] — Asymmetric Grounding Adherence Loss (L_AGA)
- [[asynchronous-rl-llm]] — 异步强化学习与大语言模型后训练
- [[atlas-memory-system]] — Atlas Memory System
- [[attention-entropy-collapse]] — 注意力熵崩溃 (Attention Entropy Collapse)
- [[attention-mechanism]] — Attention Mechanism
- [[attention-sinks]] — 注意力汇 (Attention Sinks)
- [[attractor-dynamics]] — 吸引子动力学 (Attractor Dynamics)
- [[audio-visual-generation]] — Audio-Visual Generation
- [[audio-visual-representation-alignment]] — Audio-Visual Representation Alignment
- [[autoharness]] — AutoHarness
- [[automated-theorem-proving]] — 自动定理证明 (Automated Theorem Proving, ATP)
- [[automatic-prompt-optimization]] — APO 自动提示工程 (Automatic Prompt Optimization)
- [[autonomous-optimization-ao]] — Autonomous Optimization (AO)
- [[autoregressive-unrolling]] — 自回归展开 (Autoregressive Unrolling)
- [[autoregressive-video-generation]] — Autoregressive Video Generation
- [[auxiliary-predictive-objectives]] — 辅助预测目标 (Auxiliary Predictive Objectives)
- [[backtranslation-round-trip-relay]] — Backtranslation Round-Trip Relay
- [[banach-space]] — Banach 空间 (Banach Space)
- [[bare-adapter]] — Bare Adapter
- [[barker-gibbs]] — Barker Gibbs
- [[base-table-embedding]] — Base Table Embedding
- [[bastiani-calculus]] — Bastiani 微积分 (Bastiani Calculus)
- [[batch-vs-real-time-inference]] — 批处理推理 vs 实时推理
- [[bayesian-attention-geometry]] — Bayesian Attention Geometry (贝叶斯注意力几何)
- [[bayesian-attention-trilogy]] — Bayesian Attention Trilogy
- [[bayesian-deep-learning]] — 贝叶斯深度学习 (Bayesian Deep Learning)
- [[bayesian-filtering]] — 贝叶斯滤波
- [[bayesian-nonparametric-tpp]] — 贝叶斯非参数 TPP (Bayesian Nonparametric TPP)
- [[bayesian-wind-tunnels]] — Bayesian Wind Tunnels
- [[belief-accumulation]] — Belief Accumulation (信念累积)
- [[belief-state]] — 信念状态 (Belief State)
- [[belief-transport]] — Belief Transport (信念传输)
- [[bellman-taylor-score-decoding]] — Bellman-Taylor 得分解码 (BTSD)
- [[bidirectional-trajectory-evaluation]] — 双向轨迹评估 (Bidirectional Trajectory Evaluation)
- [[binding-constraint-thesis]] — Binding-Constraint Thesis（约束瓶颈论）
- [[block-causal-attention]] — Block-Causal Attention
- [[block-sparse-attention]] — Block-Sparse Attention Mask (分块稀疏注意力掩码)
- [[bm25-financial-retrieval]] — BM25 金融检索
- [[boundary-compliance]] — Boundary Compliance
- [[bounded-reuse]] — 有界复用 (Bounded Reuse)
- [[bpf-syscall-interception]] — BPF系统调用拦截
- [[btsd-ppo]] — BTSD-PPO
- [[build-vs-buy-llm]] — 构建 vs 购买 — Build vs Buy (LLM)
- [[bypass-network-handle-distribution]] — Bypass Network Handle Distribution (旁路网络句柄分发)
- [[cace-principle]] — CACE 原理 — Change Anything Changes Everything
- [[cache-cold-start]] — Cache Cold-Start (缓存冷启动)
- [[cache-health-observability]] — Cache Health Observability（缓存健康度可观测性）
- [[cache-hit-ratio]] — Cache Hit Ratio (CHR)
- [[cache-invalidation]] — Cache Invalidation（缓存失效）
- [[cache-safe-forking]] — Cache-Safe Forking（缓存安全分叉）
- [[caddy-web-server]] — Caddy Web Server
- [[candidate-graph]] — 候选图 — Candidate Graph
- [[capability-control-tradeoff]] — Capability-Control Tradeoff（能力-控制权衡）
- [[capability-degradation]] — 能力退化 (Capability Degradation)
- [[catastrophic-forgetting]] — 灾难性遗忘 (Catastrophic Forgetting)
- [[causal-decomposition-pomg]] — 因果分解 (Causal Decomposition in POMG)
- [[causal-generation]] — Causal Generation
- [[causal-information-flow]] — Causal Information Flow
- [[causal-multimodal-vae]] — Causal Multimodal VAE
- [[cel-shading-style]] — 赛璐璐风格 (Cel-Shading)
- [[center-manifold-theorem]] — Center Manifold Theorem (中心流形定理)
- [[centralized-agent-architecture]] — 集中式Agent架构
- [[certainty-based-loss]] — Certainty-Based Loss
- [[certainty-based-rewards]] — 确定性奖励 (Certainty-Based Rewards)
- [[chain-of-thought]] — 思维链 (Chain-of-Thought, CoT)
- [[chaitin-algorithmic-information-theory]] — 算法信息论 (Algorithmic Information Theory, AIT)
- [[chaitin-constant]] — 蔡廷常数 Ω (Chaitin's Constant)
- [[cl-bench-life]] — CL-Bench Life
- [[classifier-free-guidance-language]] — Classifier-Free Guidance for Language
- [[claw-swe-bench-lite]] — Claw-SWE-Bench Lite
- [[clawforce]] — ClawForce — 企业 AI Agent 方案
- [[clawless]] — ClawLess
- [[clean-conditioning-mask]] — 清洁条件掩码 (Clean-Conditioning Mask)
- [[clinical-ai]] — 临床人工智能 (Clinical AI)
- [[coarse-grained-counting]] — 粗粒度计数 (Coarse-grained Counting)
- [[coarse-grained-recurrence]] — 粗粒度循环 (Coarse-Grained Recurrence)
- [[coarse-to-fine-granularity]] — Coarse-to-Fine Granularity
- [[coconut]] — COCONUT: 连续潜空间推理
- [[code-as-harness]] — Code as Harness
- [[cognitive-architecture]] — Cognitive Architecture (认知架构)
- [[collectivist-ai]] — 集体主义 AI（Collectivist AI）
- [[compiled-ai-paradigm]] — Compiled AI Paradigm (编译型 AI 范式)
- [[completeness-logic]] — 完备性 (Completeness, 逻辑学)
- [[composable-base-model-architecture]] — Composable Base Model Architecture
- [[compressed-sparse-attention]] — Compressed Sparse Attention (CSA)
- [[computability-theory]] — 可计算性理论 (Computability Theory)
- [[computer-use-agents]] — Computer Use Agents (CUAs)
- [[computerized-adaptive-testing]] — Computerized Adaptive Testing (CAT)
- [[concept-lattice]] — 概念格 (Concept Lattice)
- [[concept-learning]] — 概念学习：几何视角 (Concept Learning: Geometric View)
- [[conditional-intensity-function]] — 条件强度函数 (Conditional Intensity Function)
- [[conditional-memory]] — Conditional Memory
- [[conditional-model-dispatcher]] — Conditional Model Dispatcher
- [[confidence-correctness-alignment]] — 置信度-正确性对齐 (Confidence-Correctness Alignment)
- [[consistency-logic]] — 一致性 (Consistency, 逻辑学)
- [[constant-kv-cache]] — Constant KV Cache
- [[content-based-reasoning]] — Content-Based Reasoning
- [[content-diversity-decay]] — 内容多样性衰减（Content Diversity Decay）
- [[content-grounded-retrieval]] — Content-Grounded Retrieval — Faithfulness as First Principle
- [[content-homogenization]] — 内容同质化（Content Homogenization）
- [[content-question-answering]] — Content Question Answering (CQA)
- [[context-anchoring]] — 历史上下文锚定（Context Anchoring）
- [[context-blue-clique]] — Context Blue Clique（上下文蓝色团）
- [[context-compression]] — Context Compression（上下文压缩）
- [[context-drift]] — Context Drift（上下文漂移）
- [[context-engineering]] — Context Engineering（上下文工程）
- [[context-enriched-embeddings]] — 上下文增强嵌入 — Context Enriched Embeddings
- [[context-learning]] — 上下文学习 (Context Learning)
- [[context-management]] — Context Management（上下文管理）
- [[context-misuse]] — 上下文误用 (Context Misuse)
- [[context-pruning]] — Context Pruning (上下文剪枝)
- [[context-state-estimation]] — Context as State Estimation（上下文作为状态估计）
- [[continual-learning]] — 持续学习 (Continual Learning)
- [[continuation-value-function]] — 延续价值函数 (Continuation Value Function)
- [[continuous-diffusion-language-models]] — Continuous Diffusion Language Models
- [[continuous-representation]] — 连续表征 (Continuous Representation)
- [[continuous-thought-machine]] — Continuous Thought Machine (CTM)
- [[continuous-time-rl]] — 连续时间强化学习 (Continuous-Time RL)
- [[continuum-hypothesis]] — 连续统假设 (Continuum Hypothesis, CH)
- [[control-affine-mdp]] — 控制仿射 MDP (Control-Affine MDP)
- [[controlled-autonomy]] — Controlled Autonomy (受控的自主性)
- [[controlled-text-generation]] — Controlled Text Generation
- [[convex-hull-relaxation]] — Convex-Hull Relaxation (KV Cache)
- [[coordinator-executor-architecture]] — Coordinator-Executor Architecture
- [[cost-aware-benchmarking]] — 代价感知基准评测 (Cost-Aware Benchmarking)
- [[cost-quality-speed-trilemma]] — Cost-Quality-Speed Trilemma（成本-质量-速度三元悖论）
- [[countable-uncountable-infinity]] — 可数与不可数无穷
- [[covariance-matrix]] — 协方差矩阵 (Covariance Matrix)
- [[covariance-matrix-knowledge]] — 协方差矩阵知识存储 (Covariance Matrix Knowledge Storage)
- [[cramer-rao-lower-bound]] — Cramér-Rao Lower Bound (CRLB)
- [[crawl4ai]] — Crawl4AI
- [[critical-failures]] — Critical Failures / 关键失败
- [[critpt]] — CritPt (Critical Point Benchmark)
- [[cross-head-budget-allocation]] — Cross-Head Budget Allocation
- [[cross-model-harness-transfer]] — Cross-Model Harness Transfer（跨模型 Harness 迁移）
- [[cross-section-synthesis]] — Cross-Section Synthesis — Information Integration Across Document Parts
- [[curvine-distributed-cache]] — Curvine 云原生分布式缓存
- [[dag-reasoning-evaluation]] — DAG-based Reasoning Evaluation
- [[darwin-godel-machine]] — Darwin Gödel Machine (达尔文·哥德尔机)
- [[data-augmentation]] — 数据增强 (Data Augmentation)
- [[data-hierarchical-governance]] — Data Hierarchical Governance (L0-L4 数据分级治理)
- [[data-label-consistency]] — Data-Label Consistency (数据-标签一致性)
- [[data-markets]] — 数据市场（Data Markets）
- [[data-quality-over-scale]] — Data Quality over Scale (数据质量重于规模)
- [[data-quality-vs-quantity]] — 数据数量 vs 数据质量
- [[data-replay]] — 数据回放 (Data Replay)
- [[data-slice]] — Data Slice
- [[data-swamp]] — 数据沼泽 — Data Swamp
- [[data-wall]] — 数据墙 (Data Wall)
- [[dcgwm]] — DCGWM (双通道接地世界建模)
- [[ddcadam]] — DDCAdam (Dead-Direction-Calibrated Adam)
- [[dead-direction]] — 死方向 (Dead Direction)
- [[decentralized-agent-architecture]] — 去中心化Agent架构
- [[deep-and-wide-reasoning]] — Deep-and-Wide Reasoning（深度且宽广的推理）
- [[deep-gaussian-process]] — 深度高斯过程 (Deep Gaussian Process)
- [[deep-rl-scaling]] — 扩展深度强化学习 (Scaling Deep RL)
- [[deep-thinking-sft]] — Deep-Thinking SFT (深思考SFT数据)
- [[deep-variational-implicit-process]] — 深度变分隐式过程 (DVIP)
- [[deepencoder]] — DeepEncoder
- [[deepseek-ocr]] — DeepSeek OCR
- [[deepseek-r1]] — DeepSeek-R1
- [[deepseek-v4-flash]] — DeepSeek-V4-Flash
- [[deepseek-vit]] — DeepSeek-ViT
- [[default-tools]] — Default Tools — 始终可用的通用工具
- [[delegate-52]] — DELEGATE-52
- [[delegated-work]] — Delegated Work / 委托工作
- [[delta-rule]] — Delta Rule
- [[depth-dilemma]] — 深度困境 (Depth Dilemma)
- [[depth-recurrence]] — 深度循环 (Depth Recurrence)
- [[depth-scaling-signal-degradation]] — LLM 深度扩展与信号退化
- [[deterministic-agent-failures]] — Deterministic Agent Failures（确定性 Agent 失败分类）
- [[dgae]] — Difficulty-Balanced Group Advantage Estimation (DGAE)
- [[dgpo]] — Difficulty-Aware Group Policy Optimization (DGPO)
- [[diagonal-ramsey-number]] — Diagonal Ramsey Number（对角拉姆齐数）
- [[diagonalization-method]] — 对角线方法 (Diagonalization Method)
- [[differentiable-token-budgeting]] — Differentiable Token Budgeting
- [[diffusion-based-tpp]] — 扩散时间点过程 (Diffusion-based TPP)
- [[diffusion-transformer]] — Diffusion Transformer (DiT)
- [[dime-dynamic-in-database-modeling-engine]] — DIME (Dynamic In-Database Modeling Engine)
- [[discrete-diffusion-language-models]] — discrete-diffusion-language-models
- [[distractor-context]] — Distractor Context / 干扰上下文
- [[distributed-cache-routing]] — Distributed Cache Routing (分布式缓存路由)
- [[distributed-optimistic-locking]] — Distributed Optimistic Locking (分布式乐观锁)
- [[distributed-prompt-caching]] — Distributed Prompt Caching (分布式提示词缓存)
- [[distribution-shift]] — Distribution Shift（分布偏移）
- [[document-degradation]] — Document Degradation / 文档退化
- [[domain-aware-preference-optimization]] — Domain-Aware Preference Optimization
- [[domain-knowledge-reasoning]] — 领域知识推理 (Domain Knowledge Reasoning)
- [[domain-specific-evaluation]] — Domain-Specific Evaluation / 领域特定评估
- [[dominant-shuffle]] — Dominant Shuffle
- [[double-descent]] — 双下降 (Double Descent)
- [[dpo]] — DPO (Direct Preference Optimization)
- [[dpo-bias-mitigation]] — DPO Bias Mitigation
- [[dqw]] — Difficulty-Aware Question-Level Weighting (DQW)
- [[drift-detection]] — 漂移检测 (Drift Detection)
- [[drifting]] — Temporal Drift (时序漂移)
- [[dual-collapse]] — Dual Collapse in Latent CoT
- [[dual-layer-rl]] — Dual-Layer RL (双层强化学习)
- [[dual-space-rl]] — Dual Space RL (DSRL)
- [[duo-attention]] — DuoAttention
- [[dynamic-in-database-modeling]] — Dynamic In-Database Modeling
- [[dynamic-mode-decomposition]] — Dynamic Mode Decomposition (DMD)
- [[dynamic-model-fusion]] — Dynamic Model Fusion
- [[dynamic-react]] — Dynamic ReAct — 动态工具选择
- [[dynamic-relation-modeling]] — Dynamic Relation Modeling
- [[dynamic-state-evolution]] — Dynamic State Evolution
- [[dynamic-token-limit]] — 动态 Token 限制 (Dynamic Token Limit)
- [[dynamic-weight-updates]] — Dynamic Weight Updates
- [[e-values]] — E-values（证据值）
- [[edge-of-stability]] — Edge of Stability (EoS)
- [[ellipsis-prompt]] — 省略号提示 (Ellipsis Prompt)
- [[eluder-dimension]] — Eluder 维度 (Eluder Dimension)
- [[embedded-language-flows]] — Embedded Language Flows (ELF)
- [[eml-operator]] — EML 算子 (Exp-Minus-Log)
- [[emmy-noether]] — 埃米·诺特 (Emmy Noether)
- [[emotional-reasoning-bias]] — Emotional Reasoning Bias
- [[emotional-value-evaluation]] — 情绪价值评估 (Emotional Value Evaluation)
- [[empirical-discovery-simulation]] — 经验发现与模拟 (Empirical Discovery & Simulation)
- [[empirical-fisher]] — Empirical Fisher (经验 Fisher 信息)
- [[end-to-end-ocr]] — End-to-End OCR
- [[end-to-end-streaming-interaction]] — End-to-End Streaming Interaction
- [[endogenous-reasoning]] — Endogenous Reasoning（内生推理）
- [[engram]] — Engram (Conditional Memory Module)
- [[enhanced-state-space-models]] — 增强状态空间模型 (Enhanced State-Space Models)
- [[ensemble-based-rewards]] — 集成奖励 (Ensemble-Based Rewards)
- [[environment-contract-layer]] — Environment Contract Layer（环境契约层）
- [[epistemic-uncertainty]] — 认知不确定性 (Epistemic Uncertainty)
- [[epoch-based-optimistic-mle]] — Epoch-based 乐观 MLE (Epoch-based Optimistic MLE)
- [[etclovg-taxonomy]] — ETCLOVG 七层分类法
- [[evolution-probe]] — 进化探针 (Evolution Probe)
- [[evolutionary-algorithms]] — Evolutionary Algorithms (进化算法)
- [[evolving-knowledge-injection]] — 进化知识注入 (Evolving Knowledge Injection)
- [[execution-environment]] — Execution Environment（执行环境与沙箱）
- [[execution-fidelity]] — Execution Fidelity
- [[execution-harness]] — Execution Harness
- [[expected-calibration-error]] — 预期校准误差 (Expected Calibration Error, ECE)
- [[experience-distillation]] — 经验蒸馏 (Experience Distillation)
- [[experience-representation]] — 经验表示 (Experience Representation)
- [[exploratory-dynamics]] — 探索动力学 (Exploratory Dynamics)
- [[exponential-decay-reward]] — 指数衰减奖励 (Exponential Decay Reward)
- [[extended-kalman-filter]] — 扩展 Kalman 滤波
- [[fact-augmented-key-expansion]] — Fact-Augmented Key Expansion
- [[fading-memory]] — 衰减记忆 (Fading Memory)
- [[faithfulness-in-ai]] — Faithfulness in AI
- [[feature-absorption]] — 特征吸收 (Feature Absorption)
- [[feature-family]] — 特征家族 (Feature Family)
- [[feature-splitting]] — 特征分裂 (Feature Splitting)
- [[feedforward-depth-limitation]] — 前馈深度局限 (Feedforward Depth Limitation)
- [[few-shot-learning]] — Few-Shot Learning (少样本学习)
- [[fiber-of-parametrization]] — 参数化纤维 (Fiber of Parametrization)
- [[financial-agent-permission]] — 金融 Agent 权限管控
- [[financial-llm-deployment]] — 金融行业大模型部署约束
- [[financial-llm-model-selection]] — 金融大模型选型
- [[financial-llm-requirements]] — 金融行业好需求工程
- [[fine-grained-counting]] — 细粒度计数 (Fine-grained Counting)
- [[first-lyapunov-coefficient]] — First Lyapunov Coefficient (第一Lyapunov系数)
- [[fisher-information-metric]] — Fisher 信息度量 (Fisher Information Metric)
- [[fisher-lipschitz]] — Fisher-Lipschitz 假设类
- [[fisher-width]] — Fisher Width (Fisher 宽度)
- [[five-axis-positional-encoding]] — 五轴位置编码 (Five-Axis Positional Encoding)
- [[fixed-mean-gaussian-process]] — 固定均值高斯过程 (Fixed-Mean Gaussian Process)
- [[flash-attention]] — FlashAttention
- [[flash-attention-3]] — FlashAttention-3
- [[flex-attention]] — FlexAttention
- [[flip-bifurcation]] — Flip Bifurcation (翻转分岔)
- [[flow-matching]] — Flow Matching
- [[forecasting-augmentation-taxonomy]] — Forecasting Augmentation Taxonomy
- [[formal-concept-analysis]] — 形式概念分析 (Formal Concept Analysis)
- [[formal-security-model]] — 形式化安全模型
- [[formal-systems]] — 形式系统 (Formal System)
- [[formal-verification]] — Formal Verification (形式化验证)
- [[forward-authentication]] — 外部认证委托 (Forward Authentication)
- [[forward-repair-ladder]] — Forward-Repair Ladder
- [[foundation-model-frontier-bias]] — 基础模型前沿偏倚（Foundation Model Frontier Bias）
- [[fourier-filter-dynamics]] — Fourier Filter for Dynamics（Fourier Filter 动力学分解）
- [[fp4-quantization-training]] — FP4 Quantization-Aware Training
- [[freetimegs]] — FreeTimeGS
- [[freqmask-freqmix]] — FreqMask / FreqMix
- [[full-duplex-interaction]] — Full-Duplex Interaction
- [[function-space-modeling]] — 函数空间建模 (Function-Space Modeling)
- [[functional-input-neural-networks]] — 函数输入神经网络 (Functional Input Neural Network)
- [[furstenberg-correspondence]] — Furstenberg Correspondence Principle
- [[future-commit-cleanup]] — Future-Commit 清理 (Future-Commit Cleanup)
- [[gambling-gibbs]] — Gambling Gibbs
- [[gaussian-filtering]] — 高斯滤波
- [[gaussian-manifold]] — 高斯流形
- [[gaussian-process]] — 高斯过程 (Gaussian Process)
- [[gaussian-width]] — Gaussian Width (高斯宽度)
- [[gbrain-memory]] — GBrain Memory System
- [[gene-bench]] — Gene-Bench
- [[gene-evolution-protocol]] — 基因进化协议 (GEP)
- [[gene-probe]] — 基因探针 (Gene Probe)
- [[generalization-bounds]] — 泛化界 (Generalization Bounds)
- [[generalized-delta-rule]] — Generalized Delta Rule
- [[generation-verification-asymmetry]] — 生成-验证不对称性 (Generation-Verification Asymmetry)
- [[generative-general-unification]] — Generative-General-Unification (GenAI 三支柱)
- [[generative-perplexity]] — generative-perplexity
- [[generative-recommendation]] — 生成式推荐 (Generative Recommendation)
- [[generative-reconstruction-latent]] — Generative Reconstruction (Latent)
- [[genetic-programming]] — Genetic Programming (遗传编程)
- [[geometric-compression-latent]] — Geometric Compression (Latent CoT)
- [[geometric-ramsey-theory]] — Geometric Ramsey Theory（几何拉姆齐理论）
- [[georg-cantor]] — 格奥尔格·康托尔 (Georg Cantor)
- [[gflownet-fine-tuning]] — GFlowNet 微调
- [[gibbs-posterior]] — Gibbs 后验
- [[glitch-art-style]] — 故障艺术 (Glitch Art)
- [[global-combinatorial-optimization]] — Global Combinatorial Optimization (KV Cache)
- [[global-context-hash-tree]] — Global Context Hash Tree (全局上下文哈希树)
- [[godel-incompleteness-theorems]] — 哥德尔不完备定理 (Gödel's Incompleteness Theorems)
- [[godel-numbering]] — 哥德尔编码 (Gödel Numbering)
- [[goodsteins-theorem]] — 古德斯坦定理 (Goodstein's Theorem)
- [[governance-security]] — Governance & Security（治理与安全）
- [[gpt-image2]] — GPT-Image-2
- [[gradient-alignment]] — Gradient Alignment (PreRL)
- [[gram-generative-recursive-reasoning]] — GRAM（Generative Recursive reAsoning Models）
- [[granger-causality-tpp]] — Granger 因果发现 (Granger Causality in TPP)
- [[gravitino-unified-metadata]] — Gravitino 统一元数据管理
- [[greedy-context-screening]] — Greedy Context Screening（贪心上下文筛选）
- [[green-tao-theorem]] — Green-Tao Theorem
- [[group-relative-policy-optimization]] — 群体相对策略优化 (GRPO)
- [[grouped-query-attention]] — Grouped-Query Attention (GQA)
- [[grpo]] — Group Relative Policy Optimization (GRPO)
- [[gui-tool-hybrid-action-space]] — GUI-Tool Hybrid Action Space
- [[gumbel-softmax]] — Gumbel-Softmax 重参数化
- [[halftone-print-style]] — 半调印刷风格 (Halftone Print Style)
- [[hallucination-mitigation]] — Hallucination Mitigation in LLM Systems
- [[halting-problem]] — 停机问题 (Halting Problem)
- [[hard-token]] — Hard Token
- [[hardening-execution-environments]] — Hardening Execution Environments（硬化执行环境）
- [[hardware-aware-algorithm]] — Hardware-Aware Algorithm (Mamba)
- [[harness-as-action-verifier]] — Harness-as-Action-Verifier
- [[harness-as-policy]] — Harness-as-Policy (Code as Policy)
- [[harness-coupling-problem]] — Harness Coupling Problem（Harness 耦合问题）
- [[harness-engineering]] — Harness Engineering
- [[harness-evolution]] — Harness Evolution（轨迹驱动的 Harness 进化）
- [[harness-model-interaction]] — Harness × Model 交互效应
- [[harnessaudit]] — HarnessAudit
- [[hars]] — HARS（调和适应保留评分）
- [[hawkes-process]] — Hawkes 过程 (Hawkes Process)
- [[head-level-budget-allocation]] — Head-Level Budget Allocation
- [[head-structure-ssm]] — SSM 多头结构 (Head Structure for SSMs)
- [[heavily-compressed-attention]] — Heavily Compressed Attention (HCA)
- [[held-out-validation-gate]] — Held-Out Validation Gate (留出验证门)
- [[heuristic-learning]] — Heuristic Learning (启发式学习)
- [[heuristic-metric]] — Heuristic Metric (KV Cache)
- [[hidden-audit-channel]] — Hidden Audit Channel
- [[hidden-symmetries-neural]] — 隐藏对称性 (Hidden Symmetries)
- [[hierarchical-semantic-routing]] — 层次语义路由 — Hierarchical Semantic Routing
- [[hierarchy-preservation]] — Hierarchy Preservation — Structural Knowledge for Literature Ranking
- [[hilberts-program]] — 希尔伯特计划 (Hilbert's Program)
- [[hippo]] — HiPPO (High-order Polynomial Projection Operators)
- [[history-aware-routing]] — 历史感知路由 — History-Aware Routing
- [[honest-open-subset]] — Honest 开子集 (Honest Open Subset)
- [[hrpo]] — HRPO: Hybrid Reasoning Policy Optimization
- [[human-agent-trust]] — 人机信任 (Human-Agent Trust)
- [[human-centered-ai]] — Human-Centered AI (以人类为中心的 AI)
- [[human-in-the-loop]] — Human-in-the-Loop — 人机协同
- [[hybrid-attention-architecture]] — Hybrid Attention Architecture
- [[hybrid-reasoning]] — 混合推理 (Hybrid Reasoning)
- [[hybrid-reasoning-models]] — 混合推理模型 (Hybrid Reasoning Models)
- [[hybrid-recall-pipeline]] — Hybrid Recall Pipeline (BM25 + Dense)
- [[hyperagents]] — Hyperagents (超智能体)
- [[hypergraph-ramsey-number]] — Hypergraph Ramsey Number（超图拉姆齐数）
- [[hyperplane-arrangements]] — 超平面排列 (Hyperplane Arrangements)
- [[hypothesis-tree-refinement]] — Hypothesis Tree Refinement (HTR)
- [[identity-reference-resolution]] — 身份指代消解 (Identity Reference Resolution)
- [[image-generation-prompt-design]] — 图像生成 Prompt 设计
- [[implicit-processes]] — 隐式过程 (Implicit Processes)
- [[in-context-learning]] — 上下文学习 (In-Context Learning)
- [[in-context-learning-rate]] — In-Context Learning Rate
- [[in-database-analytics]] — In-Database Analytics
- [[induction-heads]] — Induction Heads
- [[inference-primitives]] — Inference Primitives (推理原语)
- [[inference-time-scaling]] — Inference-Time Scaling（推理时扩展）
- [[infinite-dimensional-manifolds]] — 无限维流形 (Infinite-Dimensional Manifolds)
- [[infinite-width-limit]] — 无限宽度极限 (Infinite-Width Limit)
- [[infinity-hierarchy]] — 无穷层级体系 (Infinity Hierarchy)
- [[information-cocoons]] — 信息茧房（Information Cocoons）
- [[information-flow-control]] — Information Flow Control
- [[information-geometry]] — 信息几何 (Information Geometry)
- [[information-leakage-vla]] — Information Leakage in VLA
- [[information-performance-binding]] — Information-Performance Binding
- [[input-superposition]] — Input Superposition
- [[insight-backpropagation]] — Insight Backpropagation
- [[intensity-free-modeling]] — Intensity-free 建模
- [[interaction-based-explanation]] — 交互基解释 (Interaction-Based Explanation)
- [[interaction-generalizability]] — 交互泛化性 (Interaction Generalizability)
- [[interaction-order]] — 交互阶数 (Interaction Order)
- [[interaction-types-sft]] — SFT 中的三类交互 (Removed, Preserved, Newly Emerged)
- [[interleaved-gui-tool-trajectory-scaling]] — Interleaved GUI-Tool Trajectory Scaling Pipeline
- [[internal-ticks]] — Internal Ticks
- [[internal-world-model]] — Internal World Model
- [[intersectional-persona-evaluation]] — Intersectional Persona Evaluation
- [[intervention-multiplier]] — Intervention Multiplier
- [[intra-head-eviction]] — Intra-Head Eviction
- [[intrabench]] — IntraBench — Benchmark for Content-Grounded Literature QA
- [[intragent]] — IntrAgent — Structural-Aware Literature Reading Agent
- [[intraview]] — IntraView — Content-Grounded Literature Information Retrieval
- [[intrinsic-rewards-sharpening]] — 内在奖励锐化机制 (Intrinsic Rewards Sharpening)
- [[inward-only-gradient-flow]] — Inward-Only Gradient Flow (内向梯度流)
- [[isolation-necessity-theorem]] — Isolation Necessity Theorem (隔离必要性定理)
- [[isotonic-regression]] — Isotonic Regression
- [[itemic-text-alignment]] — Itemic-Text 对齐 (Itemic-Text Alignment)
- [[itemic-tokens]] — Itemic Token
- [[iterative-capability-extension]] — 迭代能力扩展 — Iterative Capability Extension
- [[iterative-code-refinement]] — Iterative Code Refinement (迭代代码精炼)
- [[iterative-reading]] — Iterative Reading — Progressive Information Extraction from Literature
- [[ito-calculus]] — Itô 微积分 (Itô Calculus)
- [[jagged-frontier]] — Jagged Frontier / 锯齿前沿
- [[jepa]] — JEPA (Joint Embedding Predictive Architecture)
- [[jepa-for-robotics]] — JEPA for Robotics
- [[k-pass-training]] — K-Pass Training (K 遍训练)
- [[kalman-filter]] — Kalman 滤波
- [[keydiff]] — KeyDiff
- [[kl-order]] — KL 阶 (KL Order)
- [[klein-blue]] — 克莱因蓝 (Klein Blue / IKB)
- [[knowledge-adaptation]] — 知识适应 (Knowledge Adaptation)
- [[knowledge-agnostic-augmentation]] — 知识无关增强 (Knowledge-Agnostic Augmentation)
- [[knowledge-aware-augmentation]] — 知识感知增强 (Knowledge-Aware Augmentation)
- [[knowledge-bank]] — Knowledge Bank — AI 辅助开发时代的知识管理系统
- [[knowledge-injection]] — 知识注入 (Knowledge Injection)
- [[knowledge-internalization]] — 知识内化 (Knowledge Internalization)
- [[knowledge-retention]] — 知识保留 (Knowledge Retention)
- [[knowledge-tree]] — 知识树 (Knowledge Tree)
- [[kolmogorov-complexity]] — 柯尔莫哥洛夫复杂度 (Kolmogorov Complexity)
- [[koopman-autoencoder]] — Koopman Autoencoder (KAE)
- [[koopman-predictor]] — Koopman Predictor（Koopman 预测器）
- [[koopman-theory]] — Koopman Theory（Koopman 理论）
- [[kore-augmentation]] — KORE-AUGMENTATION（知识导向增强）
- [[kore-constraint]] — KORE-CONSTRAINT（知识导向约束）
- [[kv-cache]] — KV Cache
- [[kv-cache-bottleneck]] — KV 缓存内存瓶颈
- [[kv-cache-eviction]] — KV Cache Eviction
- [[kvcache-transfer]] — KVCache 传输与优化
- [[language-gradient]] — Language Gradient (语言梯度)
- [[language-loss]] — Language Loss (语言损失)
- [[large-reasoning-models]] — 大推理模型 (Large Reasoning Models)
- [[latent-action-pretraining]] — Latent-Action Pretraining
- [[latent-reasoning]] — 潜在推理 (Latent Reasoning)
- [[latent-score-mdp]] — 潜在得分 MDP (Latent-Score MDP)
- [[latent-thought-models]] — 隐式思考模型 (Latent Thought Models)
- [[latent-variable-generative-model]] — Latent-Variable Generative Model（潜在变量生成模型）
- [[latent-world-model]] — Latent World Model (Robotics)
- [[layered-memory-architecture]] — 三层记忆架构
- [[leakage-free-state-prediction]] — Leakage-Free State Prediction
- [[length-extrapolation]] — 长度外推 (Length Extrapolation)
- [[leopold-kronecker]] — 利奥波德·克罗内克尔 (Leopold Kronecker)
- [[leworldmodel]] — LeWorldModel
- [[lifecycle-aware-harness]] — Lifecycle-Aware Harness（生命周期感知 Harness）
- [[lifecycle-orchestration]] — Lifecycle & Orchestration（生命周期与编排）
- [[lifting-identity]] — Lifting Identity (提升恒等式)
- [[light-routing-agent]] — 轻量路由 Agent — Light Routing Agent
- [[linear-attention]] — 线性注意力 (Linear Attention)
- [[linear-attention-methods]] — 线性注意力方法 (Linear Attention Methods)
- [[linear-quadratic-regulator]] — 线性二次调节器 (Linear Quadratic Regulator)
- [[linear-representation-hypothesis]] — Linear Representation Hypothesis
- [[linearized-neural-network]] — 线性化神经网络 (Linearized Neural Network)
- [[llama-factory]] — LLaMA-Factory
- [[llm-applications]] — LLM 应用
- [[llm-based-temporal-point-process]] — LLM 时间点过程 (LLM-based TPP)
- [[llm-consistent-reasoning]] — LLM Consistent Reasoning
- [[llm-evaluation-benchmarks]] — LLM 评测基准体系
- [[llm-mcmc]] — LLM-MCMC
- [[logfire]] — Logfire
- [[logical-model-interaction]] — 交互逻辑模型 (Logical Model of Interactions)
- [[long-context-understanding]] — 长上下文理解 (Long-Context Understanding)
- [[long-horizon-evaluation]] — Long-Horizon Evaluation / 长视界评估
- [[long-horizon-parsing]] — Long-Horizon Parsing
- [[long-horizon-utility]] — Long-Horizon Utility
- [[long-range-dependency]] — Long-Range Dependency
- [[long-term-interactive-memory]] — Long-Term Interactive Memory
- [[longmem-eval]] — LongMemEval Benchmark
- [[look-ahead-buffer-controller]] — Look-Ahead Buffer Controller
- [[lora]] — LoRA (Low-Rank Adaptation)
- [[lost-in-the-middle]] — Lost in the Middle
- [[lovasz-local-lemma]] — Lovász Local Lemma
- [[lucas-penrose-argument]] — 卢卡斯-彭罗斯论证 (Lucas-Penrose Argument)
- [[lukv]] — LU-KV (Long-horizon Utility KV)
- [[macro-level-token-economics]] — Macro-Level Token Economics
- [[mamba-2]] — Mamba-2
- [[mamba-ssm]] — Mamba (State Space Model)
- [[manifold-constrained-hyper-connections]] — Manifold-Constrained Hyper-Connections (mHC)
- [[manifold-of-minimizers]] — Manifold of Minimizers (极小值流形)
- [[marginal-utility]] — Marginal Utility (KV Cache)
- [[marked-temporal-point-process]] — 标记时间点过程 (Marked TPP)
- [[martingale-clt]] — 鞅中心极限定理 (Martingale CLT)
- [[math-question-reformulation]] — 数学问题多维度改写
- [[mathchatsync-reasoning]] — MathChatSync Reasoning
- [[mathematical-pluralism]] — 数学多元主义 (Mathematical Pluralism)
- [[mathematical-priority-disputes]] — 数学优先权争议
- [[mathforge]] — MathForge 框架
- [[maze-navigation]] — 迷宫导航 (Maze Navigation)
- [[mc-dropout]] — MC Dropout (Monte Carlo Dropout)
- [[mcp-protocol]] — MCP 协议 — Model Context Protocol
- [[mcp-tools-dataset]] — MCP-tools 数据集
- [[me2-principle]] — ME² Principle
- [[mechanistic-interpretability]] — 机制可解释性 (Mechanistic Interpretability)
- [[megatron-lm]] — Megatron-LM
- [[mem2skill]] — Mem2Skill — 记忆到技能转化
- [[memcube]] — MemCube — 最小记忆单元
- [[memory-caching-rnn]] — Memory Caching (MC)
- [[memory-compute-decoupling]] — Memory-Compute Decoupling
- [[memory-consolidation]] — Memory Consolidation（写后提炼）
- [[memory-dedup-pipeline]] — 记忆去重管线
- [[memory-governance]] — 记忆治理 — Memory Governance
- [[memory-indexing-retrieval-reading]] — Memory Indexing-Retrieval-Reading Framework
- [[meso-level-token-economics]] — Meso-Level Token Economics
- [[messy-context-reasoning]] — 混乱上下文推理 (Messy Context Reasoning)
- [[meta-jctrader]] — Meta-JCTrader
- [[meta-learning]] — Meta-Learning (元学习)
- [[meta-tools]] — Meta Tools — 管理工具的工具
- [[metacognitive-self-modification]] — Metacognitive Self-Modification (元认知自我修改)
- [[metamathematics]] — 元数学 (Metamathematics)
- [[micro-level-token-economics]] — Micro-Level Token Economics
- [[million-token-context]] — Million-Token Context
- [[mineru]] — minerU — PDF-to-Markdown for Scientific Literature
- [[minimax-optimality]] — Minimax 最优性 (Minimax Optimality)
- [[mixture-of-attention-schemes]] — Mixture of Attention Schemes (MoAS)
- [[mixture-of-depths-attention]] — Mixture-of-Depths Attention (MoDA)
- [[mixture-of-experts]] — Mixture of Experts (MoE)
- [[ml-technical-debt]] — ML 技术债务
- [[mme-voke]] — MMEVOKE
- [[model-collapse-step]] — 模型崩溃步 (Model Collapse Step, MCS)
- [[model-driven-vs-app-driven-memory]] — 模型驱动 vs 应用驱动记忆
- [[model-free-rl]] — Model-Free 强化学习 (Model-Free RL)
- [[model-harness-relationship]] — Model-Harness Relationship (模型与Harness关系)
- [[model-steering]] — Model Steering
- [[moe-lora]] — MoELoRA
- [[moe-lora-toolchain-conflict]] — MOE + LoRA 工具链冲突
- [[moment-matching-filter]] — 矩匹配滤波
- [[monocular-video-to-4d]] — 单目视频到 4D (Monocular Video to 4D)
- [[mqr]] — Multi-Aspect Question Reformulation (MQR)
- [[mrq-algorithm]] — MR.Q 算法 (MR.Q Algorithm)
- [[multi-agent-orchestration]] — Multi-Agent Orchestration（多 Agent 编排）
- [[multi-agent-safety]] — Multi-Agent Safety
- [[multi-agent-spiral]] — 多智能体螺旋（Multi-Agent Spiral）
- [[multi-dimensional-synthetic-data]] — 多维合成数据 (Multi-Dimensional Synthetic Data)
- [[multi-head-attention]] — Multi-Head Attention (MHA)
- [[multi-head-latent-attention]] — Multi-head Latent Attention (MLA)
- [[multi-hot-cross-entropy]] — Multi-hot Cross-Entropy (MCE)
- [[multi-query-attention]] — Multi-Query Attention (MQA)
- [[multi-solution-recovery]] — Multi-Solution Recovery（多解恢复）
- [[multi-step-planning]] — 多步规划 (Multi-Step Planning)
- [[multi-teacher-on-policy-distillation]] — Multi-Teacher On-Policy Distillation (MODPO)
- [[multi-token-prediction]] — Multi-Token Prediction (MTP)
- [[multi-trajectory-inference]] — Multi-Trajectory Inference（多轨迹推理）
- [[multi-turn-reasoning]] — Multi-Turn Reasoning Training (多轮推理训练)
- [[multi-view-captioning]] — 多视角字幕 (Multi-View Captioning)
- [[multimodal-large-language-model]] — 多模态大语言模型 (MLLM)
- [[multimodal-rag]] — 多模态 RAG (Multimodal RAG)
- [[multitask-rl]] — 多任务强化学习 (Multitask RL)
- [[muon-optimizer]] — Muon Optimizer
- [[nachbin-theorem]] — Nachbin 定理
- [[native-sparse-attention]] — Native Sparse Attention (NSA)
- [[native-streaming-ar-training]] — Native Streaming AR Training
- [[natural-gradient-descent]] — 自然梯度下降
- [[negative-sample-reinforcement]] — Negative Sample Reinforcement (NSR)
- [[neural-synchronization]] — Neural Synchronization as Representation
- [[neural-tangent-kernel]] — 神经正切核 (Neural Tangent Kernel)
- [[neural-temporal-point-process]] — 神经时间点过程 (Neural TPP)
- [[neurida]] — NeurIDA
- [[neuroalgebraic-geometry]] — 神经代数几何 (Neuroalgebraic Geometry)
- [[neuromanifold]] — 神经流形 (Neuromanifold)
- [[neuron-level-models]] — Neuron-Level Models (NLMs)
- [[neuron-pairing]] — Neuron Pairing
- [[neuroscience]] — Neuroscience (神经科学)
- [[next-state-grounding]] — Next-State Grounding
- [[ngram-embedding]] — N-gram Embedding (in LLMs)
- [[non-anticipative-functionals]] — 非预期泛函 (Non-Anticipative Functionals)
- [[non-stationary-time-series]] — Non-stationary Time Series（非平稳时间序列）
- [[non-thinking-mode]] — 非思考模式 (Non-Thinking Mode)
- [[normal-tangent-decomposition]] — Normal-Tangent Decomposition (法向-切向分解)
- [[ntk-aware-interpolation]] — NTK-aware 位置编码插值
- [[null-space]] — 零空间 (Null Space)
- [[null-space-projection-knowledge]] — 零空间投影知识保留 (Null Space Projection for Knowledge Retention)
- [[objective-driven-ai]] — 目标驱动AI (Objective-Driven AI)
- [[objective-interference-collapse]] — Objective Interference Collapse (目标干扰坍缩)
- [[observability]] — Observability & Operations（可观测性与运维）
- [[observable-operator-model]] — 可观测算子模型 (Observable Operator Model, OOM)
- [[off-policy-llm-post-training]] — Off-Policy LLM 后训练
- [[offline-profiling]] — Offline Profiling (LU-KV)
- [[omnidocbench]] — OmniDocBench
- [[on-policy-distillation]] — On-Policy Distillation (OPD)
- [[on-policy-learning-collapse]] — On-policy Learning Collapse
- [[one-pass-fine-tuning]] — One-Pass Fine-Tuning (单遍微调)
- [[onereason-bench]] — OneReason-Bench
- [[onerec]] — OneRec 生成式推荐模型族
- [[open-telemetry]] — OpenTelemetry (OTel)
- [[openclaw]] — OpenClaw
- [[opinion-polarization]] — 观点极化（Opinion Polarization）
- [[optimal-gui-tool-path-selection]] — Optimal GUI-Tool Path Selection
- [[optimality-gap]] — Optimality Gap
- [[oracle-importance]] — Oracle Importance
- [[order-bias-removal]] — Order Bias Removal
- [[osworld-mcp]] — OSWorld-MCP Benchmark
- [[output-aware-metric]] — Output-Aware Metric (OAM)
- [[overthinking]] — 过度思考 (Overthinking)
- [[pac-bayesian-bounds]] — PAC-Bayesian 泛化界 (PAC-Bayesian Bounds)
- [[pageindex]] — PageIndex
- [[paley-graph]] — Paley Graph
- [[parallel-scan]] — Parallel Scan (Parallel Associative Scan)
- [[parametrization-map]] — 参数化映射 (Parametrization Map)
- [[pareto-frontier-evaluation]] — Pareto 前沿评测 (Pareto Frontier Evaluation)
- [[paris-harrington-theorem]] — Paris-Harrington Theorem（巴黎-哈灵顿定理）
- [[partially-observable-markov-game]] — 部分可观测马尔可夫博弈 (Partially Observable Markov Game, POMG)
- [[pass-at-k-vs-pass-k]] — Pass@k vs Pass^k（能力上限 vs 可靠性下限）
- [[passive-vs-active-knowledge]] — 被动知识 vs 主动知识
- [[patch-based-evaluation]] — Patch-Based Evaluation (基于 Patch 的评测合约)
- [[path-tracing]] — 路径追踪 (Path Tracing)
- [[pdf-processing]] — PDF Processing
- [[peano-arithmetic]] — 皮亚诺算术 (Peano Arithmetic, PA)
- [[per-index-time-decay]] — Per-Index Time Decay
- [[perception-cognition-recommendation]] — 感知-认知推荐层次 (R0-R3)
- [[perception-gap]] — 感知鸿沟 (Perception Gap)
- [[persona-invariant-reasoning]] — Persona-Invariant Reasoning
- [[personalization-trap]] — 个性化陷阱 (Personalization Trap)
- [[pldm]] — PLDM (Pretrained Latent Dynamics Model)
- [[poisson-process]] — 泊松过程 (Poisson Process)
- [[policy-constrained-execution]] — Policy-Constrained Execution
- [[policy-regret]] — 策略后悔 (Policy Regret)
- [[policy-reincarnation]] — Policy Reincarnation
- [[polysemanticity]] — 多义性与单义性 (Polysemanticity & Monosemanticity)
- [[pomdp]] — 部分可观测马尔可夫决策过程 (POMDP)
- [[position-encoding]] — Position Encoding (位置编码)
- [[position-id-discrepancy]] — Position ID Discrepancy (位置 ID 偏差)
- [[positive-sample-reinforcement]] — Positive Sample Reinforcement (PSR)
- [[post-action-configuration]] — 后动作配置 (Post-Action Configuration)
- [[post-hoc-reasoning-rl]] — 后置推理 RL (Post-Hoc Reasoning RL)
- [[post-train-space-rl]] — Post-train Space Reinforcement Learning
- [[posterior-linearization-filter]] — 后验线性化滤波
- [[posterior-lipschitz-adversary]] — 后验李普希茨对手 (Posterior-Lipschitz Adversary)
- [[practitioner-research-gap]] — Practitioner-Research Gap（从业者-研究鸿沟）
- [[pre-activation-history]] — Pre-Activation History
- [[pre-hoc-reasoning-rl]] — 前置推理 RL (Pre-Hoc Reasoning RL)
- [[pre-train-space-reinforcement-learning]] — Pre-train Space Reinforcement Learning (PreRL)
- [[precision-weighted-fusion]] — 精度加权融合 (Precision-Weighted Fusion)
- [[prediction-driven-inference]] — 预测驱动推断（Prediction-Driven Inference）
- [[predictive-representation-learning]] — 预测表征学习 (Predictive Representation Learning)
- [[preference-log-odds]] — Preference Log-Odds
- [[preference-utility-analysis]] — Preference–Utility Analysis
- [[prefill-as-a-service]] — Prefill-as-a-Service (PrfaaS)
- [[prefill-decode-disaggregation]] — Prefill-Decode 分离架构 (PD Disaggregation)
- [[prefix-matching]] — Prefix Matching（前缀匹配）
- [[preserved-interactions-backbone]] — 保留交互作为推理支柱 (Preserved Interactions as Inference Backbone)
- [[pretraining-statistical-bias]] — 预训练统计偏好（Pretraining Statistical Bias）
- [[primitive-completeness]] — Primitive Completeness (原语完备性)
- [[primitive-recursive-functions]] — 原始递归函数 (Primitive Recursive Functions)
- [[probabilistic-method]] — Probabilistic Method（概率方法）
- [[probability-matching]] — 概率匹配（Probability Matching）
- [[procedural-gap]] — 过程性鸿沟 — Procedural Gap
- [[procedural-skill]] — 过程技能 (Procedural Skill)
- [[procedural-skill-layer]] — Procedural Skill Layer（程序技能层）
- [[procedural-task-execution]] — 程序性任务执行 (Procedural Task Execution)
- [[product-stability]] — Product-Stability (乘积稳定性)
- [[program-synthesis]] — Program Synthesis (程序合成)
- [[prompt-caching]] — Prompt Caching
- [[prompt-engineering-vs-fine-tuning]] — 提示词工程 vs 微调
- [[prompt-layering]] — Prompt Layering（提示分层）
- [[prompt-reverse-engineering]] — 图片反推 Prompt (Prompt Reverse Engineering)
- [[prompt-to-harness-evolution]] — Prompt-to-Harness Evolution（三阶段工程演进）
- [[prope]] — PRoPE (Projective Rotary Position Encoding)
- [[prospective-memory-index]] — Prospective Memory Index (前瞻记忆索引)
- [[pseudo-huber-loss]] — Pseudo-Huber 损失
- [[pydantic]] — Pydantic
- [[pydantic-ai]] — Pydantic AI
- [[pydantic-core]] — pydantic-core
- [[pyramidkv]] — PyramidKV
- [[qlora]] — QLoRA (量化低秩适配)
- [[quadrotor-trajectory-following]] — 四旋翼轨迹跟踪 (Quadrotor Trajectory Following)
- [[query-intent-analyzer]] — Query Intent Analyzer
- [[question-quality-vs-quantity]] — Question Quality vs. Quantity（问题质量 vs 数量）
- [[queueing-network-control]] — 排队网络控制 (Queueing Network Control)
- [[rademacher-complexity]] — Rademacher Complexity
- [[rag]] — RAG (检索增强生成)
- [[rag-closed-loop]] — RAG 闭环迭代（RAG Closed-Loop Iteration）
- [[rag-systems]] — RAG 系统
- [[ramsey-context-cache]] — Ramsey Context Cache（拉姆齐上下文缓存）
- [[ramsey-context-graph]] — Ramsey Context Graph（拉姆齐上下文图）
- [[ramsey-context-template]] — Ramsey Context Template（拉姆齐上下文模板）
- [[ramsey-numbers]] — Ramsey Numbers（拉姆齐数）
- [[ramsey-theory]] — Ramsey Theory（拉姆齐理论）
- [[ramsey-theory-applications]] — Ramsey Theory Applications（拉姆齐理论应用）
- [[random-access-binding]] — Random-Access Binding (随机访问绑定)
- [[random-graph-theory]] — Random Graph Theory（随机图理论）
- [[real-life-context-learning]] — 真实生活上下文学习 (Real-Life Context Learning)
- [[real-log-canonical-threshold]] — 实对数典范阈值 (Real Log Canonical Threshold, RLCT)
- [[reasoning-quality-optimization]] — Reasoning Quality Optimization
- [[recommendation-cot]] — 推荐思维链 (Recommendation CoT)
- [[recommendation-reasoning]] — 推荐推理 (Recommendation Reasoning)
- [[rectified-flows]] — Rectified Flows
- [[recurrence-taxonomy]] — 循环分类法 (Recurrence Taxonomy)
- [[recurrent-transformer-architectures]] — 循环Transformer架构 (Recurrent Transformer Architectures)
- [[recursive-reasoning-models]] — Recursive Reasoning Models（递归推理模型）
- [[recursive-self-improvement]] — Recursive Self-Improvement (递归自我改进)
- [[reer-reverse-knowledge-extraction]] — REER 逆向知识提炼
- [[reference-gap]] — 引用鸿沟 (Reference Gap)
- [[reference-sliding-window-attention]] — Reference Sliding Window Attention (R-SWA)
- [[regular-language-recognition]] — Regular Language Recognition
- [[reinforced-online-policy-distillation]] — Reinforced Online-Policy Distillation (ROPD)
- [[reinforcement-learning]] — 强化学习 (Reinforcement Learning)
- [[reinforcement-learning-trading]] — Reinforcement Learning Trading（强化学习交易）
- [[rejected-edit-buffer]] — Rejected-Edit Buffer (拒绝编辑缓冲)
- [[rejection-sampling-fine-tuning]] — Rejection Sampling Fine-tuning (RSFT)
- [[relational-graph]] — Relational Graph
- [[reliable-state-long-running-agents]] — Reliable State in Long-Running Agents（长期运行中的可靠状态）
- [[rep-mt-sac]] — RepMT-SAC
- [[reparameterization-exploration]] — 重参数化探索 (Reparameterization Exploration)
- [[replay-buffer-rl-llm]] — Replay Buffer 在 LLM RL 中的应用
- [[representation-alignment]] — Representation Alignment
- [[representation-collapse]] — 表征坍缩 (Representation Collapse)
- [[representation-learning-rl]] — RL中的表征学习 (Representation Learning in RL)
- [[representation-space]] — Representation Space
- [[representation-validity]] — Representation Validity
- [[representational-alignment]] — 表征对齐 (Representational Alignment)
- [[research-hypothesis-tree]] — Research Hypothesis Tree
- [[resource-access-control]] — Resource Access Control
- [[reverse-proxy-authentication]] — 反向代理认证 (Reverse Proxy Authentication)
- [[reward-hacking]] — Reward Hacking（奖励黑客）
- [[reward-hacking-llm]] — LLM 奖励黑客 (Reward Hacking in LLMs)
- [[reward-model]] — 奖励模型 (Reward Model, RM)
- [[reward-recency-sampling]] — 奖励-最近度混合采样
- [[richard-dedekind]] — 里夏德·狄德金 (Richard Dedekind)
- [[risograph-print-style]] — Riso 印刷风格 (Risograph Print Style)
- [[rlhf]] — RLHF (Reinforcement Learning from Human Feedback)
- [[rlhf-alignment-amplification]] — RLHF 对齐放大（RLHF Alignment Amplification）
- [[rlvr-unified-framework]] — RLVR 统一理论框架
- [[role-setting-entrenchment]] — 角色设定固化（Role-Setting Entrenchment）
- [[rolling-kv-cache]] — 滚动 KV 缓存 (Rolling KV Cache)
- [[rollout-drift]] — Rollout Drift (推演漂移)
- [[rotary-position-embedding]] — 旋转位置编码 (RoPE)
- [[rough-path-theory]] — 粗糙路径理论 (Rough Path Theory)
- [[round-trip-reconstruction-score]] — Round-Trip Reconstruction Score (RS@k)
- [[rule-system-application]] — 规则系统应用 (Rule System Application)
- [[runtime-governance]] — 运行时治理 — Skill Governance
- [[runtime-harness-adaptation]] — Runtime Harness Adaptation（运行时骨架适配）
- [[runtime-interface-adaptation]] — Runtime Interface Adaptation（运行时接口适配）
- [[russells-paradox]] — 罗素悖论 (Russell's Paradox)
- [[russian-constructivism]] — 俄国构成主义 (Russian Constructivism)
- [[rwkv]] — RWKV (Receptance Weighted Key Value)
- [[s-token]] — S-Token (Superposed Token)
- [[safety-adherence-rate]] — Safety Adherence Rate
- [[scaling-permutation-symmetry]] — 缩放与置换对称性 (Scaling & Permutation Symmetries)
- [[scientific-literature-qa]] — Scientific Literature QA — Question Answering over Research Papers
- [[sde-sampler-language]] — SDE Sampler for Language Diffusion
- [[se3-relative-camera-encoding]] — SE(3) 相对相机编码
- [[search-and-load]] — Search and Load — 精选工具加载
- [[searcher-trainer-decoupling]] — Searcher-Trainer 解耦架构
- [[section-ranking]] — Section Ranking — Structure-Aware Literature Section Prioritization
- [[secure-containers]] — 安全容器
- [[seer-attention]] — SeerAttention
- [[selective-copy]] — Selective Copying
- [[selective-hitl]] — 选择性 HITL — Selective Human-in-the-Loop
- [[selective-state-space]] — Selective State Space (S6)
- [[selective-state-space-models]] — 选择性状态空间模型 (Selective State Space Models)
- [[self-conditioning]] — Self-Conditioning
- [[self-evolutionary-mutation]] — 自进化变异 — Self-Evolutionary Mutation
- [[self-evolving-agents]] — Self-Evolving Agents (自进化 Agent)
- [[self-evolving-benchmark]] — 自进化基准 (Self-Evolving Benchmark)
- [[self-improving-ai]] — Self-Improving AI (自我改进人工智能)
- [[self-reference]] — 自指 (Self-Reference)
- [[self-resampling]] — Self-Resampling
- [[self-verification-rewards]] — 自我验证奖励 (Self-Verification Rewards)
- [[semantic-equivalence]] — Semantic Equivalence / 语义等价
- [[semi-algebraic-set]] — 半代数集 (Semi-algebraic Set)
- [[semiseparable-matrices]] — 半可分矩阵 (Semiseparable Matrices)
- [[sequence-packing]] — Sequence Packing (序列打包)
- [[sequential-dependency]] — 顺序依赖 (Sequential Dependency)
- [[set-theory-history]] — 集合论史
- [[sft-denoising-stage]] — SFT 去噪阶段 (SFT Denoising Stage)
- [[sft-early-stopping]] — SFT 早停策略 (SFT Early Stopping)
- [[sglang]] — SGLang
- [[shadow-calling]] — Shadow Calling (影子调用)
- [[shapley-values]] — Shapley 值 (Shapley Values)
- [[shared-parameter-influence]] — Shared Parameter Influence
- [[shared-weight-discretization]] — Shared-Weight Discretization
- [[sharpness]] — Sharpness (锐度)
- [[signature]] — 签名 (Signature of Paths)
- [[sigreg]] — SIGReg (Sketch Isotropic Gaussian Regularization)
- [[singular-learning-theory]] — 奇异学习理论 (Singular Learning Theory)
- [[singularity]] — Singularity (奇点)
- [[sink-token]] — 汇 Token (Sink Token)
- [[situational-test-emotional-understanding]] — Situational Test of Emotional Understanding (STEU)
- [[skill-acquisition]] — Skill 获取 — 四种路径
- [[skill-as-external-state]] — Skill as External State (Skill 作为外部状态)
- [[skill-composition]] — Skill 组合 — 多技能编排
- [[skill-data-flywheel]] — Skill Data Flywheel (Skill 数据飞轮)
- [[skill-ecosystem]] — Skill Ecosystem (Skill 生态系统)
- [[skill-evolution]] — Skill 演化 — 修订→验证→治理
- [[skill-lifecycle]] — Skill 生命周期
- [[skill-probe]] — 技能探针 (Skill Probe)
- [[skill-representation]] — Skill 表示 — 文本/代码/混合
- [[skill-retrieval]] — Skill 检索 — 稠密/稀疏/生成/结构
- [[skill-selection]] — Skill 选择 — 上下文/组合/效用/反馈
- [[skillopt]] — SkillOpt
- [[slow-meta-update]] — Slow/Meta Update (慢/元更新)
- [[snapkv]] — SnapKV
- [[social-capital-framework]] — Social Capital Framework (AI Bias)
- [[social-video]] — Social Video
- [[social-world-model]] — Social World Model
- [[socialvideo-bench]] — SocialVideo Bench
- [[soft-actor-critic]] — Soft Actor-Critic (SAC)
- [[soft-supersession]] — Soft-Supersession
- [[soft-token]] — Soft Token
- [[softmax-off-by-one]] — SoftMax-off-by-One
- [[sovereign-ai]] — 主权AI (Sovereign AI)
- [[space-supervision]] — Space Supervision
- [[sparse-attention-patterns]] — 稀疏注意力模式 (Sparse Attention Patterns)
- [[sparse-autoencoder]] — 稀疏自编码器 (Sparse Autoencoder)
- [[sparsity-allocation]] — Sparsity Allocation (U-shaped Law)
- [[specialist-training-pipeline]] — Specialist Training Pipeline
- [[specialize-then-unify-rl]] — Specialize-then-Unify RL
- [[specialized-rl]] — 专项强化学习 (Specialized RL)
- [[specialized-sft]] — 专项监督微调 (Specialized SFT)
- [[spectral-mdp-decomposition]] — 谱 MDP 分解 (Spectral MDP Decomposition)
- [[spiking-neural-networks]] — Spiking Neural Networks (SNN)
- [[spiral-of-silence]] — 沉默的螺旋（Spiral of Silence）
- [[split-steering]] — SPLIT Steering
- [[spurious-predictability]] — Spurious Predictability
- [[ssd-algorithm]] — SSD 算法 (Structured State Space Duality Algorithm)
- [[stage-matched-data-config]] — Stage-Matched Data Configuration (分阶段数据配置)
- [[standard-agent-handoffs]] — Standard Agent Handoffs（标准化 Agent 交接）
- [[state-dependent-feasible-action-sets]] — 状态依赖可行动作集 (State-Dependent Feasible Action Sets)
- [[state-space-models]] — 状态空间模型 (State-Space Models)
- [[state-tracking]] — 状态追踪 (State Tracking)
- [[statistical-contract-theory]] — 统计合同理论（Statistical Contract Theory）
- [[statistical-manifold]] — Statistical Manifold (统计流形)
- [[staug]] — STAug (EMD-based Augmentation)
- [[steering-dynamics]] — Steering Dynamics
- [[steering-vector]] — Steering Vector
- [[stein-lemma]] — Stein 引理
- [[stem-sparse-attention]] — Stem Sparse Attention
- [[step-recurrence]] — 步级循环 (Step Recurrence)
- [[stochastic-differential-equation]] — 随机微分方程 (Stochastic Differential Equation)
- [[stochastic-latent-trajectory]] — Stochastic Latent Trajectory（随机潜在轨迹）
- [[strategy-engineering-unification]] — Strategy-Engineering Unification (策略与工程统一)
- [[strategy-gene]] — 策略基因 (Strategy Gene)
- [[streaming-generation]] — Streaming Generation
- [[streaming-inference]] — Streaming Inference
- [[structured-knowledge]] — 结构化知识 (Structured Knowledge)
- [[structured-masked-attention]] — 结构化掩码注意力 (Structured Masked Attention)
- [[structured-output]] — 结构化输出 (Structured Output)
- [[structured-state-space-duality]] — 结构化状态空间对偶 (Structured State Space Duality)
- [[structured-state-space-models]] — Structured State Space Models (S4)
- [[stub-pattern]] — Stub Pattern（轻量化桩模式）
- [[subquadratic-transformer-alternatives]] — 次二次 Transformer 替代方案
- [[sufficiency-check]] — Sufficiency Check — Explicit Hallucination Gate in Literature QA
- [[sufficient-context-paradox]] — 充分上下文悖论 (Sufficient Context Paradox)
- [[superposition]] — 叠加 (Superposition)
- [[supervised-fine-tuning]] — 监督微调 (Supervised Fine-Tuning, SFT)
- [[swe-bench]] — SWE-bench
- [[symbolic-backpropagation]] — Symbolic Back-Propagation (符号反向传播)
- [[symbolic-network]] — Symbolic Network (符号网络)
- [[symbolic-regression]] — Symbolic Regression
- [[synapse-model]] — Synapse Model
- [[synthetic-data]] — 合成数据 (Synthetic Data)
- [[synthetic-data-qa-generation]] — Synthetic Data QA Generation (合成数据Q&A生成)
- [[system-2-thinking]] — System 2 思维
- [[system-message-abuse]] — System Message Abuse（系统消息滥用）
- [[system-stability]] — System Stability
- [[szemerédi-regularity-lemma]] — Szemerédi Regularity Lemma
- [[tabular-foundation-models]] — Tabular Foundation Models
- [[tapestry-federated]] — Tapestry 联邦训练
- [[task-conditioned-policy]] — 任务条件策略 (Task-Conditioned Policy)
- [[task-distribution]] — 任务分布 (Task Distribution)
- [[task-invariant-representation]] — 任务不变表征 (Task-Invariant Representation)
- [[taylor-expansion-q-function]] — Q 函数 Taylor 展开 (Taylor Expansion of Q-Function)
- [[tba]] — Trajectory Balance with Asynchrony (TBA)
- [[teacher-forced-history]] — 教师强制历史 (Teacher-Forced History)
- [[temperature-sampling]] — 温度采样（Temperature Sampling）
- [[temporal-decay-neural]] — Temporal Decay (Neural)
- [[temporal-patch-shuffle]] — Temporal Patch Shuffle (TPS)
- [[temporal-point-process]] — 时间点过程 (Temporal Point Process)
- [[temporal-rollout]] — 时间滚动展开 (Temporal Rollout)
- [[tensor-contraction-duality]] — 张量收缩对偶 (Tensor Contraction Duality)
- [[terminal-bench]] — Terminal-Bench
- [[test-time-control]] — 测试时控制 (Test-Time Control)
- [[test-time-scaling]] — Test-Time Scaling
- [[test-time-training-rl]] — 测试时训练 RL (Test-Time Training with RL)
- [[text-space-optimizer]] — Text-Space Optimizer (文本空间优化器)
- [[text-vs-weight-optimization]] — Text vs Weight Optimization (文本 vs 权重优化)
- [[textual-learning-rate]] — Textual Learning Rate (文本学习率)
- [[thinker-performer-pipeline]] — Thinker-Performer Pipeline
- [[thinking-based-non-thinking]] — TNT: 基于思考的非思考 (Thinking-Based Non-Thinking)
- [[thinking-mode]] — 思考模式 (Thinking Mode)
- [[thinking-reward-model]] — Thinking Reward Model (TRM)
- [[thinking-supervision-transfer]] — Thinking Supervision Transfer
- [[thompson-sampling-code-search]] — Thompson Sampling Code Search
- [[three-engineering-phases]] — Three Engineering Phases（三阶段工程演进）
- [[three-stage-curriculum-training]] — 三阶段课程训练 (Three-Stage Curriculum Training)
- [[throughput-hypothesis]] — Throughput Hypothesis (吞吐量假说)
- [[time-aware-query-expansion]] — Time-Aware Query Expansion
- [[time-series-forecasting-augmentation]] — Time Series Forecasting Augmentation
- [[time-variant-dynamics]] — Time-variant Dynamics（时变动力学）
- [[token-as-economic-primitive]] — Token as Economic Primitive
- [[token-duplication]] — Token Duplication (Token 复制)
- [[token-economics]] — Token Economics
- [[token-efficiency]] — Token 效率 (Token Efficiency)
- [[token-level-policy-gradient]] — Token 级策略梯度 (Token-Level Policy Gradient)
- [[token-market-dynamics]] — Token Market Dynamics
- [[token-position-decay]] — Token Position-Decay (TPD)
- [[token-security-economics]] — Token Security Economics
- [[token-shift]] — Token Shift
- [[token-superposition-training]] — Token Superposition Training (TST)
- [[token-wise-routing]] — 逐Token路由 (Token-Wise Routing)
- [[tool-bootstrapped-rft]] — Tool-Bootstrapped GUI RFT
- [[tool-efficient-path-reward]] — Tool-Efficient Path Reward
- [[tool-interface]] — Tool Interface & Protocol Layer（工具接口与协议层）
- [[tool-registry]] — 工具注册表 — Tool Registry
- [[tpp-applications]] — TPP 应用场景
- [[tpp-training-methods]] — TPP 训练方法
- [[trace-native-evaluation]] — Trace-Native Evaluation（踪迹原生评估）
- [[trading-lifecycle-driven-eviction]] — Trading-Lifecycle Driven Eviction (交易生命周期驱动淘汰)
- [[trajectory-auditing]] — Trajectory Auditing
- [[trajectory-balance-objective]] — Trajectory Balance (TB) 目标
- [[trajectory-regulation-layer]] — Trajectory Regulation Layer（轨迹调控层）
- [[trajectory-supervision]] — Trajectory Supervision
- [[trajectory-synthesis]] — 轨迹合成 — Trajectory Synthesis
- [[transfer-learning]] — Transfer Learning (迁移学习)
- [[trm-preference-dataset]] — TRM-Preference Dataset
- [[two-phase-pretraining]] — Two-Phase Pre-Training
- [[two-time-scale-process]] — 双时间尺度过程 (Two Time-Scale Process)
- [[type-safety-in-agents]] — Agent 类型安全 (Type Safety in Agents)
- [[typeadapter]] — TypeAdapter
- [[ultradata]] — UltraData
- [[uncancelled-interaction-effects]] — 未抵消交互效应 (Uncancelled Interaction Effects)
- [[uncertainty-disparity-ratio]] — 不确定性差异比 (Uncertainty Disparity Ratio, UDR)
- [[uncertainty-equity-gap]] — 不确定性公平性差距 (Uncertainty Equity Gap, UEG)
- [[uncertainty-quantification]] — 不确定性量化 (Uncertainty Quantification)
- [[uncertainty-taxonomy]] — Jordan 不确定性分类法（Uncertainty Taxonomy）
- [[unconditional-generation-latent]] — Unconditional Generation via Latent Reasoning
- [[unified-latent-probe]] — Unified Latent Probe (ULP)
- [[unified-rft]] — 统一拒绝采样微调 (Unified RFT)
- [[universal-approximation-theorem]] — 通用逼近定理 (Universal Approximation Theorem)
- [[unlimited-ocr]] — Unlimited OCR 模型
- [[unscented-kalman-filter]] — 无迹 Kalman 滤波
- [[unsupervised-rlvr]] — 无监督可验证奖励强化学习 (URLVR)
- [[update-magnitude-imbalance]] — GRPO 更新幅度不平衡
- [[upstream-downstream-learning]] — 上游-下游学习 (Upstream-Downstream Learning)
- [[user-memory-bias]] — User Memory Bias
- [[userspace-kernel]] — 用户空间内核
- [[validity-decay]] — Validity Decay
- [[van-der-waerden-theorem]] — van der Waerden Theorem
- [[variational-autoencoder]] — 变分自编码器 (Variational Autoencoder, VAE)
- [[variational-linearized-laplace-approximation]] — 变分线性化 Laplace 近似 (VaLLA)
- [[vector-valued-gating]] — Vector-Valued Gating
- [[verbatim-pre-recall]] — Verbatim Pre-Recall
- [[verification-evaluation]] — Verification & Evaluation（验证与评估）
- [[vertical-llm-knowledge-engineering]] — 垂域 LLM 知识工程 (Vertical LLM Knowledge Engineering)
- [[vicreg]] — VICReg (Variance-Invariance-Covariance Regularization)
- [[visibility-constraint]] — Visibility Constraint (可见性约束)
- [[visual-primitives]] — 视觉原语 (Visual Primitives)
- [[vla-jepa]] — VLA-JEPA (模型)
- [[vla-vision-language-action]] — VLA (Vision-Language-Action)
- [[watanabe-triple]] — Watanabe 三元组 (Watanabe's Triple)
- [[wavemask-wavemix]] — WaveMask / WaveMix
- [[weak-revealing-condition]] — 弱揭示条件 (Weak Revealing Condition)
- [[weighted-spaces]] — 加权空间 (Weighted Spaces)
- [[width-based-scaling]] — Width-Based Scaling（宽度扩展）
- [[wiener-process]] — 维纳过程 (Wiener Process)
- [[wikilinks]] — Wikilinks
- [[window-attention]] — 窗口注意力 (Window Attention)
- [[wkv-time-mixing]] — WKV Time Mixing
- [[world-model-lecun]] — LeCun 世界模型理论
- [[world-models-rl]] — World Models in RL
- [[worst-case-threat-model]] — 最坏情况威胁模型
- [[x-prediction-parameterization]] — x-Prediction Parameterization
- [[zero-cost-proxies]] — Zero-Cost Proxies (ZCP)
- [[zero-data-cold-start]] — 零数据冷启动 (Zero-Data Cold Start)

## Papers

- [[advances-temporal-point-processes-2026]] — Advances in Temporal Point Processes: Bayesian, Neural, and LLM Approaches
- [[agarwal-bayesian-attention-geometry]] — The Bayesian Geometry of Transformer Attention
- [[agent-harness-engineering-survey]] — Agent Harness Engineering: A Survey
- [[arbor-htr-2026]] — Arbor: Hypothesis-Tree Refinement (Jin et al., RUC/MSR, 2026)
- [[bartoldson-tba-2025]] — TBA: 异步轨迹平衡 — 解耦探索与学习以实现快速可扩展的 LLM 后训练
- [[behrouz-memory-caching-rnn]] — Memory Caching: RNNs with Growing Memory
- [[bellman-taylor-score-decoding]] — Bellman–Taylor Score Decoding for MDPs with State-Dependent Feasible Action Sets
- [[chen-token-economics-llm-agents]] — Token Economics for LLM Agents
- [[claw-swe-bench]] — Claw-SWE-Bench: OpenClaw 风格 Agent Harness 的代码任务基准评测
- [[clawless-ai-agent-security]] — ClawLess: AI 代理安全模型
- [[dai-mathforge-2026]] — MathForge: Harder Is Better — 难度感知GRPO与多维度问题改写
- [[dao-transformers-are-ssms-2024]] — Transformers are SSMs: Generalized Models and Efficient Algorithms Through Struc
- [[darlow-ctm-2025]] — Continuous Thought Machines (CTM)
- [[dead-directions-geometric-singular-learning]] — Dead Directions: 几何奇异学习理论
- [[deepseek-v4-million-token-context]] — DeepSeek-V4: 迈向高效百万 Token 上下文智能
- [[dou-cl-bench]] — CL-bench: 上下文学习基准——首篇定义context learning范式的论文
- [[elf-embedded-language-flows]] — ELF: Embedded Language Flows
- [[engram-conditional-memory-2026]] — Engram: Conditional Memory via Scalable Lookup (Cheng et al., PKU/DeepSeek-AI, 2
- [[fei-mcp-zero-2025]] — MCP-Zero：主动工具发现
- [[flex4dhuman]] — Flex4DHuman: 灵活多视角视频扩散用于 4D 人体重建
- [[gan-bifurcation-eos]] — A Bifurcation Theory Framework for Gradient Descent on the Edge of Stability
- [[gan-thinking-based-non-thinking-2026]] — Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybr
- [[gaurav-dynamic-react-2025]] — Dynamic ReAct：大规模 MCP 工具选择
- [[geometric-sae-concepts]] — A Geometric View for Understanding Concept Learning and Neuron Interpretation in
- [[godel-incompleteness-tutorial]] — 哥德尔不完备定理教程
- [[goru-one-pass-to-reason-2025]] — One-Pass to Reason: 多轮推理的高效单遍微调
- [[gram-generative-recursive-reasoning-paper]] — Generative Recursive Reasoning (GRAM)
- [[gu-mamba]] — Mamba: Linear-Time Sequence Modeling with Selective State Spaces
- [[hazare-dcgwm-2026]] — DCGWM: 双通道接地世界建模 — 结构防止目标干扰坍缩
- [[he-urlvr-sharpening-2026]] — How Far Can Unsupervised RLVR Scale LLM Training?
- [[hunyuan-team-cl-bench-life]] — CL-Bench Life: 真实生活上下文学习基准
- [[jordan-collectivist-ai-2025]] — AI 的集体主义经济学视角（Jordan, 2025）
- [[kore-knowledge-injection]] — KORE: Knowledge-Oriented Controls for Knowledge Injection
- [[laban-llms-corrupt-documents-delegate]] — LLMs Corrupt Your Documents When You Delegate
- [[large-language-gibbs]] — Structured Inference with Large Language Gibbs
- [[latent-cot-supervision]] — What Makes Effective Supervision in Latent Chain-of-Thought
- [[li-amd-human-perception]] — "Are You Sure?": Human Perception Vulnerability in LLM Agents
- [[liu-auditing-agent-harness-safety]] — Auditing Agent Harness Safety
- [[liu-koopa-2023]] — Koopa: Koopman 预测器驱动的非平稳时间序列学习
- [[llm-attention-survey-2026]] — 大语言模型注意力机制全面分析
- [[longmem-eval-2025]] — LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory (Wu et
- [[lou-autoharness-2026]] — AutoHarness: LLM Agent 的自动代码 Harness 合成
- [[ma-intragent-2026]] — IntrAgent: Content-Grounded Literature Information Retrieval
- [[maes-leworldmodel-2026]] — LeWorldModel: Stable End-to-End JEPA from Pixels
- [[maineCoon]] — MaineCoon: Real-Time Audio-Visual Social World Model
- [[me2-trm-reasoning-2026]] — ME² + TRM: Complex Reasoning Optimization (Zhang et al., ICML 2026)
- [[minimax-policy-regret-pomg]] — Minimax-Optimal Policy Regret in Partially Observable Markov Games
- [[mozer-topological-trouble-transformers-2026]] — The Topological Trouble With Transformers
- [[nano-filter]] — NANO Filter: 非线性贝叶斯滤波的自然梯度高斯近似
- [[nikolopoulos-spurious-predictability]] — Spurious Predictability in Financial Machine Learning
- [[niu-stem-causal-sparse-attention]] — Stem: Rethinking Causal Information Flow in Sparse Attention
- [[odrzywolek-eml-single-operator]] — All elementary functions from a single binary operator
- [[onereason]] — OneReason: 生成式推荐中的推理能力解锁
- [[ortega-phd-thesis]] — Uncertainty Estimation and Generalization Bounds for Modern Deep Learning
- [[peng-rwkv7]] — RWKV-7 Goose: Expressive Dynamic State Evolution
- [[peng-tst-2026]] — Token Superposition Training: 高效 LLM 预训练的 Token 叠加方法
- [[personalization-trap-2025]] — The Personalization Trap (Fang et al., Amazon, 2025)
- [[pre-train-space-reinforcement-learning]] — Pre-train Space Reinforcement Learning (PreRL/DSRL)
- [[predictive-representations-scalable-mtrl]] — 预测表征驱动可扩展多任务深度强化学习
- [[principled-uncertainty-clinical-ai]] — Principled Uncertainty in Clinical AI: Bayesian Modelling and Equity Auditing
- [[procedural-skills-to-strategy-genes]] — From Procedural Skills to Strategy Genes: Towards Experience-Driven Test-Time Ev
- [[qin-prfaas-cross-datacenter]] — Prefill-as-a-Service: KVCache Goes Cross-Datacenter
- [[ramsey-numbers-survey]] — 拉姆齐数的数学综述
- [[relu-neuromanifolds-semi-algebraicity]] — ReLU 神经流形的纤维与半代数性
- [[repmt-sac]] — Learning to Adapt: Representation-Based RL for Multi-Task Skill Transfer
- [[song-agent-network-taxonomy]] — Complex networks of AI agentic systems: 拓扑-记忆-更新三层分类法
- [[streaming-llm]] — StreamingLLM: 基于注意力汇的高效流式语言模型
- [[tang-lukv]] — LU-KV: Predicting Future Utility for KV Cache Eviction
- [[tao-klowden-ai-mathematical-methods]] — Mathematical methods and human thought in the age of AI
- [[tarpo]] — TARPO: Token-Wise Latent-Explicit Reasoning via Action-Routing Policy Optimizati
- [[thinking-with-visual-primitives]] — Thinking with Visual Primitives — 以视觉原语思考
- [[ticks-to-flows]] — From Ticks to Flows: Dynamics of Neural RL in Continuous Environments
- [[toolcua-optimal-gui-tool-orchestration]] — ToolCUA: Optimal GUI-Tool Path Orchestration for Computer Use Agents
- [[unlimited-ocr-works-2026]] — Unlimited OCR Works (Yin et al., Baidu, 2026)
- [[vla-jepa-2026]] — VLA-JEPA (Sun et al., 2026)
- [[vu-fisher-width-2026]] — Fisher Width: 统计流形上的几何复杂度度量
- [[wan-streamer]] — Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models
- [[weighted-uat-manifolds]] — Weighted Universal Approximation of Differentiable Maps on Infinite-Dimensional
- [[when-large-multimodal-models-confront-evolving-knowledge]] — When Large Multimodal Models Confront Evolving Knowledge
- [[xing-trails-2024]] — Trails: Database Native Model Selection (VLDB 2024)
- [[xu-life-harness]] — Adapting the Interface, Not the Model: Runtime Harness Adaptation for Determinis
- [[xu-why-steering-works]] — Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics
- [[yang-skillopt-2026]] — SkillOpt: Agent Skill 的文本空间优化器
- [[yao-ace-router-2026]] — ACE-Router：历史感知路由
- [[zeng-dynamic-model-slicing-2024]] — Powering In-Database Dynamic Model Slicing for Structured Data Analytics (VLDB 2
- [[zeng-neurida-2025]] — NeurIDA: Dynamic Modeling for Effective In-Database Analytics
- [[zhang-hyperagents]] — Hyperagents: Self-Referential Agents with Metacognitive Self-Modification
- [[zhang-reconciling-sft-interaction-2026]] — Reconciling Contradictory Views on the Effectiveness of SFT in LLMs
- [[zhao-neurdb-2025]] — NeurDB: On the Design and Implementation of an AI-powered Autonomous Database (C
- [[zhou-agent-skills-survey-2026]] — A Comprehensive Survey on Agent Skills — 综述
- [[zhou-agent-symbolic-learning-2024]] — Agent Symbolic Learning: 用符号学习实现自进化 Agent
- [[zhu-moda-mixture-of-depths]] — Mixture-of-Depths Attention (MoDA)

## Articles

- [[atlas-agent-memory-architecture-2026]] — Atlas Agent 记忆系统架构（2026）
- [[caddy-reverse-proxy-auth]] — Caddy 反向代理认证方案
- [[cantor-stole-infinity]] — 窃取无穷的数学家 — 康托尔与狄德金的隐秘合作
- [[claw-eval]] — Claw-Eval：面向自主Agent的端到端评测框架
- [[crawl4ai-open-source-web-crawler]] — Crawl4AI：赋能AI用户的开源智能网页爬虫与数据提取工具
- [[distributed-agent-cache-sync-2026]] — 分布式Agent缓存同步：从单机到多机的Prompt Caching架构升级
- [[financial-llm-practice-2026]] — 金融行业大模型落地实践（林金曙，2026）
- [[gpt-image2-prompt-collection]] — GPT-Image-2 绘图 Prompt 方法论与风格合集
- [[lecun-llm-boundary-future]] — LeCun 论 LLM 的边界与未来架构
- [[llm-spiral-of-silence-2026]] — LLM 沉默螺旋：算法催生的数字从众
- [[lyu-model-harness-evolution-2026]] — Model与Harness的关系演进：从AutoHarness到Heuristic Learning
- [[lyu-skillopt-deep-dive-2026]] — SkillOpt深度解读：自进化Agent技能的'反向传播'与工程化Continued Evolve
- [[memtensor-memos-agent-memory-2026]] — MemOS：Agent 记忆基础设施
- [[michael-jordan-mlst-collectivist-ai-2026]] — Michael I. Jordan：AI 的集体主义经济学与虚假的 AGI 二元论
- [[mini-agent-harness]] — 从零搭建 Mini Agent Harness
- [[nobrega-ai-production-tradeoffs-2026]] — AI 工程师的 6 种生产权衡
- [[oppo-multimodal-data-lake]] — OPPO 多模态数据湖架构实践
- [[prompt-caching-architecture]] — Prompt Caching 架构工程手册
- [[pydantic-three-piece-suite]] — Pydantic 三件套：从校验库到 AI 基础设施
- [[qifu-llm-finance-practice]] — 金融行业大模型落地实践：从知识工程到后训练部署
- [[ramsey-context-construction]] — 上下文构造与拉姆齐数
- [[temporal-patch-shuffle-tps]] — 时序预测增强方法综述：从频域到 TPS
- [[ultradata-l3-open-source-2026]] — UltraData：面壁智能L3数据开源与数据分级治理体系

## Special Pages

- [[index]] —
- [[log]] —
- [[README]] —
- [[SCHEMA]] —

## Reviews

- [[ace-router-review-20260619]] — ACE-Router Review
- [[advances-temporal-point-processes-review-20260616]] — Review: Advances in Temporal Point Processes
- [[agent-harness-engineering-review-20260523]] — Review: Agent Harness Engineering Survey
- [[agent-network-taxonomy-review-20260501]] — agent-network-taxonomy-review-20260501
- [[agent-skills-survey-review-20260619]] — Agent Skills Survey Review
- [[arbor-htr-20260624]] — Review: Arbor — Autonomous Research via Hypothesis-Tree Refinement
- [[auditing-agent-harness-safety-review-20260605]] — Auditing Agent Harness Safety — Review
- [[btsd-review-20260617]] — Bellman-Taylor Score Decoding 论文集成 Review
- [[cantor-stole-infinity-2026-06-07]] — 窃取无穷的数学家 — 康托尔与狄德金的历史真相
- [[cl-bench-life-review-20260501]] — CL-Bench Life 论文集成 Review
- [[cl-bench-review-20260501]] — cl-bench-review-20260501
- [[claw-swe-bench-review-20260615]] — Claw-SWE-Bench 论文集成 Review
- [[clawless-review-20260422]] — ClawLess: AI 代理安全模型 - Review 报告
- [[ctm-review-20260515]] — Continuous Thought Machines 论文集成 Review
- [[dao-transformers-are-ssms-review-20260618]] — Review: Transformers are SSMs (Mamba-2)
- [[dcgwm-2026-06-23]] — Review: DCGWM — 结构防止目标干扰坍缩的双通道接地世界建模
- [[dead-directions-20260610]] — Review: Dead Directions — Geometric Singular Learning
- [[delegate52-review-20260514]] — DELEGATE-52 Review
- [[distributed-agent-cache-sync-review]] — Review: 分布式Agent缓存同步
- [[dynamic-react-review-20260619]] — Dynamic ReAct Review
- [[elf-embedded-language-flows-review-20260513]] — Review: ELF — Embedded Language Flows
- [[engram-conditional-memory-20260625]] — Engram Review — 条件记忆作为 Transformer 的新稀疏轴
- [[fisher-width-2026-06-23]] — Review: Fisher Width — 统计流形上的几何复杂度
- [[flex4dhuman-review-20260613]] — Review: Flex4DHuman — 无几何先验的多视角视频扩散
- [[gan-bifurcation-eos-20260623]] — Review: Gan Bifurcation EoS
- [[gan-tnt-review-20260618]] — Review: Thinking-Based Non-Thinking (TNT)
- [[geometric-sae-review-20260617]] — Geometric SAE 论文集成 Review
- [[godel-tutorial-review-20260428]] — 哥德尔不完备定理教程 — Review 报告
- [[hyperagents-review-20260420]] — 📚 Wiki 添加 Review 报告 - Hyperagents 论文
- [[jordan-collectivist-ai-review-20260621]] — Review: A Collectivist, Economic Perspective on AI
- [[koopa-review-20260511]] — Review: Koopa — Koopman 预测器驱动的非平稳时序学习
- [[kore-review-20260521]] — KORE Review
- [[large-language-gibbs-2026-06-25]] — Large Language Gibbs Review
- [[latent-cot-supervision-2026-06-25]] — Latent CoT Supervision Review
- [[lecun-llm-20260608]] — Review: LeCun 论 LLM 的边界与未来架构
- [[leworldmodel-20260608]] — Review: LeWorldModel (arXiv:2603.19312)
- [[life-harness-review-20260611]] — Life-Harness — Runtime Harness Adaptation 论文 Review
- [[llm-attention-survey-review-20260429]] — Review: 大语言模型注意力机制全面分析
- [[longmem-eval-20250625]] — LongMemEval Review — 长期交互记忆的系统性评测框架
- [[lou-autoharness-review]] — Review: AutoHarness — 自动合成代码 Harness 改进 LLM Agent
- [[lukv-review-20260618]] — Review: LU-KV — Global Combinatorial Optimization for KV Cache Eviction
- [[lyu-model-harness-review]] — Review: Model与Harness的关系演进
- [[lyu-skillopt-deep-dive-review]] — Review: SkillOpt深度解读 — 自进化Agent的'反向传播'
- [[ma-intragent-review-20260604]] — IntrAgent — Content-Grounded Literature Retrieval Review
- [[mainecoon-review-20260620]] — MaineCoon Review
- [[mamba-review-20260618]] — Review: Mamba — Linear-Time Sequence Modeling with Selective State Spaces
- [[mathforge-review-20260512]] — MathForge Review — 2026-05-12
- [[mcp-zero-review-20260619]] — MCP-Zero Review
- [[me2-trm-reasoning-20260624]] — Review: ME² + TRM — Complex Reasoning Optimization
- [[minimax-policy-regret-pomg-20260610]] — Review: Minimax-Optimal Policy Regret in POMGs
- [[mozer-topological-trouble-review-20260618]] — Review: The Topological Trouble With Transformers
- [[nano-filter-20260622]] — NANO Filter Review
- [[neurida-review-20260515]] — NeurIDA 论文集成 Review
- [[one-pass-to-reason-review-20260602]] — Review: One-Pass to Reason — 多轮推理的高效单遍微调
- [[onereason-review-20260610]] — OneReason Review — 生成式推荐的推理能力解锁
- [[ortega-phd-review-20260617]] — Ortega PhD Thesis 集成 Review
- [[peng-tst-2026-review]] — Review: Token Superposition Training
- [[personalization-trap-20260624]] — Review: The Personalization Trap
- [[predictive-representations-mtrl-20260610]] — Review: Predictive Representations for Scalable Multitask Deep RL
- [[pretrain-space-rl-review-20260518]] — Review: Pre-train Space Reinforcement Learning
- [[principled-uncertainty-clinical-ai-20260610]] — Review: Principled Uncertainty in Clinical AI
- [[prompt-caching-architecture-review-20260511]] — Review: Prompt Caching 架构工程手册
- [[pydantic-three-piece-review-20260610]] — Pydantic 三件套 Review — 从校验库到 AI 基础设施
- [[ramsey-context-construction-review-20260511]] — Review: 上下文构造与拉姆齐数
- [[ramsey-numbers-survey-review-20260511]] — Review: 拉姆齐数的数学综述
- [[relu-neuromanifolds-20260610]] — Review: ReLU Neuromanifolds — Fibers and Semi-algebraicity
- [[repmt-sac-review-20260617]] — RepMT-SAC 论文集成 Review
- [[rwkv7-review-20260618]] — Review: RWKV-7 Goose — Expressive Dynamic State Evolution
- [[skills-to-genes-review-20260614]] — Skills to Strategy Genes — Review 报告
- [[stem-causal-sparse-attention-review-20260605]] — Stem: Rethinking Causal Information Flow in Sparse Attention — Review
- [[streaming-llm-review-20260514]] — Review: StreamingLLM — 基于注意力汇的无限长流式语言模型
- [[tarpo-review-20260617]] — TARPO 论文集成 Review
- [[tba-review-20260512]] — TBA Review — 2026-05-12
- [[thinking-with-visual-primitives-review-20260430]] — Review — Thinking with Visual Primitives
- [[ticks-to-flows-review-20260617]] — Ticks-to-Flows 论文集成 Review
- [[token-economics-review-20260605]] — Token Economics for LLM Agents — Review
- [[toolcua-review-20260531]] — ToolCUA Review: GUI-Tool路径编排的概念网络分析
- [[ultradata-l3-review]] — Review: UltraData — 大模型数据分级治理的开源实践
- [[unlimited-ocr-works-20260624]] — Review: Unlimited OCR Works
- [[vla-jepa-20260624]] — Review: VLA-JEPA
- [[wan-streamer-2026-06-25]] — Wan-Streamer v0.1 Review
- [[weighted-uat-review-20260617]] — Weighted UAT 论文集成 Review
- [[xu-why-steering-works-review-20260601]] — Review: Why Steering Works — 参数动态统一视角
- [[yang-skillopt-review]] — Review: SkillOpt — Agent Skill 的文本空间优化器
- [[zhang-sft-interaction-review-20260603]] — Review: Reconciling Contradictory Views on SFT in LLMs — 交互视角
- [[zhou-agent-symbolic-learning-review]] — Review: Agent Symbolic Learning — 符号学习驱动的自进化Agent