mindformers.modules.transformer.EmbeddingOpParallelConfig¶
- class mindformers.modules.transformer.EmbeddingOpParallelConfig(data_parallel=1, model_parallel=1, use_seq_parallel=False, select_recompute=False, vocab_emb_dp=True)[源代码]¶
The parallel config of
VocabEmbeddingfor the setting data parallel or model parallel for the embedding table.- Args:
- data_parallel(int): The data parallel way. The input data will be sliced into n parts for embedding layer
according to this value. Default: 1.
- model_parallel(int): The model parallel way. The embedding table parameters
will be sliced at 0-th axis according to the model parallel way. Default: 1.
- vocab_emb_dp(bool): Shard embedding in model parallel or data parallel. If True, the embedding lookup
will be a data parallel style training and model_parallel value will be ignored. If false, the embedding table will be sharded into n parts at the 0-th dimension row slice of the embedding table, where the n is the model parallel way determined by this parameter. Default: True
- Supported Platforms:
AscendGPU- Examples:
>>> from mindformers.modules.transformer import EmbeddingOpParallelConfig >>> config=EmbeddingOpParallelConfig(data_parallel=1, model_parallel=1, vocab_emb_dp=True)