Files
myWiki/concepts/megatron-lm.md

488 B
Raw Blame History

title, created, updated, type, tags, sources
title created updated type tags sources
Megatron-LM 2026-06-24 2026-06-24 concept
training-framework
distributed-training
model-parallelism
unlimited-ocr-works-2026

Megatron-LM

Megatron-LMNVIDIA, arXiv:1909.08053是训练数十亿参数大语言模型的分布式训练框架提供张量并行、流水线并行等模型并行策略。Unlimited OCR 的训练 pipeline 基于 Megatron-LM 构建。

参考