YaFSDP

An efficient distributed data parallelism framework designed for large language models.

CommonProductProgrammingDistributed ComputingData Parallelism
YaFSDP is a distributed data parallelism framework designed to work well with transformer-like neural network architectures. It is 20% faster than the traditional FSDP when pre-training large language models (LLMs) and performs better under high-memory pressure conditions. YaFSDP aims to reduce the overhead of communication and memory operations.
Visit

YaFSDP Visit Over Time

Monthly Visits

499904316

Bounce Rate

37.31%

Page per Visit

5.8

Visit Duration

00:06:52

YaFSDP Visit Trend

YaFSDP Visit Geography

YaFSDP Traffic Sources

YaFSDP Alternatives