Files
myWiki/concepts/megatron-lm.md

17 lines
488 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
title: "Megatron-LM"
created: 2026-06-24
updated: 2026-06-24
type: concept
tags: ["training-framework", "distributed-training", "model-parallelism"]
sources:
- "[[unlimited-ocr-works-2026]]"
---
# Megatron-LM
Megatron-LMNVIDIA, arXiv:1909.08053是训练数十亿参数大语言模型的分布式训练框架提供张量并行、流水线并行等模型并行策略。Unlimited OCR 的训练 pipeline 基于 Megatron-LM 构建。
## 参考
- [[unlimited-ocr-works-2026]]