mindformers.modules.transformer.EmbeddingOpParallelConfig¶
-
class
mindformers.modules.transformer.EmbeddingOpParallelConfig(data_parallel=1, model_parallel=1, vocab_emb_dp=True)[源代码]¶ The parallel config of
VocabEmbeddingfor the setting data parallel or model parallel for the embedding table.- 参数
data_parallel (int) – The data parallel way. The input data will be sliced into n parts for embedding layer according to this value. Default: 1.
model_parallel (int) – The model parallel way. The embedding table parameters will be sliced at 0-th axis according to the model parallel way. Default: 1.
vocab_emb_dp (bool) – Shard embedding in model parallel or data parallel. If True, the embedding lookup will be a data parallel style training and model_parallel value will be ignored. If false, the embedding table will be sharded into n parts at the 0-th dimension row slice of the embedding table, where the n is the model parallel way determined by this parameter. Default: True
- Supported Platforms:
AscendGPU
实际案例
>>> from mindformers.modules.transformer import EmbeddingOpParallelConfig >>> config=EmbeddingOpParallelConfig(data_parallel=1, model_parallel=1, vocab_emb_dp=True)