20260625:很多新内容

This commit is contained in:
2026-06-25 14:08:47 +08:00
parent 91fac5b6fc
commit 6021dea160
375 changed files with 19263 additions and 251 deletions

16
concepts/megatron-lm.md Normal file
View File

@@ -0,0 +1,16 @@
---
title: "Megatron-LM"
created: 2026-06-24
updated: 2026-06-24
type: concept
tags: ["training-framework", "distributed-training", "model-parallelism"]
sources:
- "[[unlimited-ocr-works-2026]]"
---
# Megatron-LM
Megatron-LMNVIDIA, arXiv:1909.08053是训练数十亿参数大语言模型的分布式训练框架提供张量并行、流水线并行等模型并行策略。Unlimited OCR 的训练 pipeline 基于 Megatron-LM 构建。
## 参考
- [[unlimited-ocr-works-2026]]