20260514:增加新内容

This commit is contained in:
2026-05-14 13:54:52 +08:00
parent 56c4d3ef7c
commit b116710e4c
294 changed files with 10682 additions and 255 deletions

View File

@@ -55,5 +55,5 @@ $$\text{MoDA}(Q_l) = \text{Softmax}\left(\frac{Q_l [K_{l-D:l}]^T}{\sqrt{d}}\righ
## 相关概念
- [[zhu-moda-mixture-of-depths]] — 原始论文
- [[depth-scaling-llms]] — LLM 深度扩展
- [[signal-degradation]] — 信号退化问题
- [[depth-scaling-signal-degradation]] — LLM 深度扩展
- [[depth-scaling-signal-degradation]] — 信号退化问题