Megatron-LM

Continuous research on training Transformer models at scale.

CommonProductProductivityTransformerLanguage Model
Megatron-LM is a powerful large-scale Transformer model developed by NVIDIA's Applied Deep Learning Research team. It is used in continuous research on training Transformer language models at scale. We utilize mixed precision, efficient model parallelism and data parallelism, along with the pre-training of multi-node Transformer models such as GPT, BERT, and T5.
Visit

Megatron-LM Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

Megatron-LM Visit Trend

Megatron-LM Visit Geography

Megatron-LM Traffic Sources

Megatron-LM Alternatives