# 模型支持列表 ## NLP ### masked_language_modeling | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :--------------------------------------: | :-----------------: | :------------------: | :-----------------: | :--------------------------------------------------------------------------------------------------------------------: | | [bert_base_uncased](model_cards/bert.md) | wiki | - | - | [run_bert_base_uncased.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/bert/run_bert_base_uncased.yaml) | ### [text_classification](task_cards/text_classification.md) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :-----------------------------------------------------------------------------------------------------------------------------------: | :-----------------: | :----------------------: | :-----------------: | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | [txtcls_bert_base_uncased](task_cards/text_classification.md)
[txtcls_bert_base_uncased_mnli](task_cards/text_classification.md) | Mnli
Mnli | Entity F1
Entity F1 | -
84.80% | [run_txtcls_bert_base_uncased.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/txtcls/run_txtcls_bert_base_uncased.yaml)
[run_txtcls_bert_base_uncased_mnli.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/txtcls/run_txtcls_bert_base_uncased_mnli.yaml) | ### [token_classification](task_cards/token_classification.md) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :----------------------------------------------------------------------------------------------------------------------------------------: | :------------------: | :----------------------: | :-----------------: | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | [tokcls_bert_base_chinese](task_cards/token_classification.md)
[tokcls_bert_base_chinese_cluener](task_cards/token_classification.md) | CLUENER
CLUENER | Entity F1
Entity F1 | -
0.7905 | [run_tokcls_bert_base_chinese.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/tokcls/run_tokcls_bert_base_chinese.yaml)
[run_tokcls_bert_base_chinese_cluener.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/tokcls/run_tokcls_bert_base_chinese_cluener.yaml) | ### [question_answering](task_cards/question_answering.md) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :----------------------------------------------------------------------------------------------------------------------------: | :------------------------: | :------------------: | :------------------: | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | [qa_bert_base_uncased](task_cards/question_answering.md)
[qa_bert_base_chinese_uncased](task_cards/question_answering.md) | SQuAD v1.1
SQuAD v1.1 | EM / F1
EM / F1 | 80.74 / 88.33
- | [run_qa_bert_base_uncased.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/qa/run_qa_bert_base_uncased.yaml)
[run_qa_bert_base_chinese_uncased.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/qa/run_qa_bert_base_chinese_uncased.yaml) | ### translation | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :---------------------------: | :-----------------: | :------------------: | :-----------------: | :------------------------------------------------------------------------------------------------------------------: | | [t5_small](model_cards/t5.md) | WMT16 | - | - | [run_t5_small_on_wmt16.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/t5/run_t5_small_on_wmt16.yaml) | ### [text_generation](task_cards/text_generation.md) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :----------------------------------------------------------------------------------------------------------------------------------------------------: | :--------------------------------------------: | :------------------------------------------: | :----------------------------------------------------------: | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | [bloom_560m](model_cards/bloom.md)
[bloom_7.1b](model_cards/bloom.md)
[bloom_65b](model_cards/bloom.md)
[bloom_176b](model_cards/bloom.md) | alpaca
alpaca
alpaca
alpaca | -
-
-
- | -
-
-
- | [run_bloom_560m.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/bloom/run_bloom_560m.yaml)
[run_bloom_7.1b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/bloom/run_bloom_7.1b.yaml)
[run_bloom_65b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/bloom/run_bloom_65b.yaml)
[run_bloom_176b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/bloom/run_bloom_176b.yaml) | | [glm_6b](model_cards/glm.md)
[glm_6b_lora](model_cards/glm.md) | ADGEN
ADGEN | BLEU-4 / Rouge-1 / Rouge-2 / Rouge-l
- | 8.42 / 31.75 / 7.98 / 25.28
- | [run_glm_6b_finetune.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/glm/run_glm_6b_finetune.yaml)
[run_glm_6b_lora.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/glm/run_glm_6b_lora.yaml) | | [glm2_6b](model_cards/glm2.md)
[glm2_6b_lora](model_cards/glm2.md) | ADGEN
ADGEN | BLEU-4 / Rouge-1 / Rouge-2 / Rouge-l
- | 7.47 / 30.78 / 7.07 / 24.77
7.23 / 31.06 / 7.18 / 24.23 | [run_glm2_6b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/glm2/run_glm2_6b.yaml)
[run_glm2_6b_lora.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/glm2/run_glm2_6b_lora.yaml) | | [gpt2_small](model_cards/gpt2.md)
[gpt2_13b](model_cards/gpt2.md)
[gpt2_52b](model_cards/gpt2.md) | wikitext-2
wikitext-2
wikitext-2 | -
-
- | -
-
- | [run_gpt2.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/gpt2/run_gpt2.yaml)
[run_gpt2_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/gpt2/run_gpt2_13b.yaml)
[run_gpt2_52b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/gpt2/run_gpt2_52b.yaml) | | [llama_7b](model_cards/llama.md)
[llama_13b](model_cards/llama.md)
[llama_65b](model_cards/llama.md)
[llama_7b_lora](model_cards/llama.md) | alpaca
alpaca
alpaca
- | -
-
-
- | -
-
-
- | [run_llama_7b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_7b.yaml)
[run_llama_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_13b.yaml)
[run_llama_65b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_65b.yaml)
[run_llama_7b_lora.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_7b_lora.yaml) | | [llama_7b](model_cards/llama.md)
[llama_13b](model_cards/llama.md)
[llama_65b](model_cards/llama.md)
[llama_7b_lora](model_cards/llama.md) | alpaca
alpaca
alpaca
- | -
-
-
- | -
-
-
- | [run_llama_7b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_7b.yaml)
[run_llama_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_13b.yaml)
[run_llama_65b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_65b.yaml)
[run_llama_7b_lora.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/llama/run_llama_7b_lora.yaml) | | [pangualpha_2_6_b](model_cards/pangualpha.md)
[pangualpha_13b](model_cards/pangualpha.md) | 悟道数据集
悟道数据集 | -
- | -
- | [run_pangualpha_2_6b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/pangualpha/run_pangualpha_2_6b.yaml)
[run_pangualpha_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/pangualpha/run_pangualpha_13b.yaml) | | [baichuan_7b](../research/baichuan/baichuan.md)
[baichuan_13b](../research/baichuan/baichuan.md) | -
- | -
- | -
- | [run_baichuan_7b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/baichuan/run_baichuan_7b.yaml)
[run_baichuan_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/baichuan/run_baichuan_13b.yaml) | | [baichuan2_7b](../research/baichuan2/baichuan2.md)
[baichuan2_13b](../research/baichuan2/baichuan2.md) | -
- | -
- | -
- | [run_baichuan2_7b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/baichuan2/run_baichuan2_7b.yaml)
[run_baichuan2_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/baichuan2/run_baichuan2_13b.yaml) | | [internlm_7b](../research/internlm/internlm.md)
[internlm_7b_lora](../research/internlm/internlm.md) | wikitext-2
alpaca | -
- | -
- | [run_internlm_7b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/internlm/run_internlm_7b.yaml)
[run_internlm_7b_lora.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/internlm/run_internlm_7b_lora.yaml) | | [ziya_13b](../research/ziya/ziya.md) | -
- | -
- | -
- | [run_ziya_13b.yaml](https://gitee.com/mindspore/mindformers/blob/dev/research/baichuan/run_ziya_13b.yaml) | ## CV ### masked_image_modeling | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :------------------------------------: | :-----------------: | :------------------: | :-----------------: | :-------------------------------------------------------------------------------------------------------------------------------------: | | [mae_vit_base_p16](model_cards/mae.md) | ImageNet-1k | - | - | [run_mae_vit_base_p16_224_800ep.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/mae/run_mae_vit_base_p16_224_800ep.yaml) | ### [image_classification](task_cards/image_classification.md) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :-----------------------------------: | :-----------------: | :------------------: | :-----------------: | :----------------------------------------------------------------------------------------------------------------------------------: | | [vit_base_p16](model_cards/vit.md) | ImageNet-1k | Accuracy | 83.71% | [run_vit_base_p16_224_100ep.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/vit/run_vit_base_p16_224_100ep.yaml) | | [swin_base_p4w7](model_cards/swin.md) | ImageNet-1k | Accuracy | 83.44% | [run_swin_base_p4w7_224_100ep.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/swin/run_swin_base_p4w7_224_100ep.yaml) | ## Multi-Modal ### [zero_shot_image_classification](task_cards/zero_shot_image_classification.md) (by [contrastive_language_image_pretrain](task_cards/contrastive_language_image_pretrain.md)) | 模型
model | 数据集
dataset | 评估指标
metric | 评估得分
score | 配置
config | | :---------------------------------------------------------------------------------------------------------------------------------------------------------------: | :------------------------------------------------: | :---------------------------------------------------------: | :----------------------------------------: | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | [clip_vit_b_32](model_cards/clip.md)
[clip_vit_b_16](model_cards/clip.md)
[clip_vit_l_14](model_cards/clip.md)
[clip_vit_l_14@336](model_cards/clip.md) | Cifar100
Cifar100
Cifar100
Cifar100 | Accuracy
Accuracy
Accuracy
Accuracy | 57.24%
61.41%
69.67%
68.19% | [run_clip_vit_b_32_pretrain_flickr8k.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/clip/run_clip_vit_b_32_pretrain_flickr8k.yaml)
[run_clip_vit_b_16_pretrain_flickr8k.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/clip/run_clip_vit_b_16_pretrain_flickr8k.yaml)
[run_clip_vit_l_14_pretrain_flickr8k.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/clip/run_clip_vit_l_14_pretrain_flickr8k.yaml)
[run_clip_vit_l_14@336_pretrain_flickr8k.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/clip/run_clip_vit_l_14@336_pretrain_flickr8k.yaml) | | [blip2_vit_g](model_cards/blip2.md) | -
flickr30k
- | -
ITM
- | -
-
- | [run_blip2_vit_g_qformer_pretrain.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/blip2/run_blip2_vit_g_qformer_pretrain.yaml)
[run_blip2_vit_g_retrieval_flickr30k.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/blip2/run_blip2_vit_g_retrieval_flickr30k.yaml)
[run_blip2_vit_g_zero_shot_image_classification_cifar100.yaml](https://gitee.com/mindspore/mindformers/blob/dev/configs/blip2/run_blip2_vit_g_zero_shot_image_classification_cifar100.yaml) | ## 模型能力支持度 ### 核心关键模型能力一览表 | 关键模型 | 并行模式 | 数据并行 | 优化器并行 | 模型并行 | 流水并行 | 多副本并行 | 预训练 | 微调 | 评估 | 推理 | | :------: | :------------------------------: | :------: | :--------: | :------: | :------: | :--------: | ------ | :----------------: | :------------: | ---: | | Bloom | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调 | 不支持 | 推理 | | GLM | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调,Lora微调 | Blue/Rouge评估 | 推理 | | GLM2 | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调,Lora微调 | Blue/Rouge评估 | 推理 | | GPT | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调 | PPL评估 | 推理 | | LLaMa | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调,Lora微调 | PPL评估 | 推理 | | LLaMa2 | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调 | PPL评估 | 推理 | | PanGu | data_parallel\semi_auto_parallel | 是 | 是 | 是 | 是 | 是 | 是 | 全参微调 | PPL评估 | 推理 | ### Research模型支持情况一览表 | 模型 | 任务(task name) | 模型(model name) | | :--------------------------------------------: | :----------------------------------------------------------------------------------------------------: | :------------------------------ | | [Baichuan](../research/baichuan/baichuan.md) | [text_generation](https://gitee.com/mindspore/mindformers/blob/dev/docs/task_cards/text_generation.md) | baichuan_7b
baichuan_13b | | [Baichuan2](../research/baichuan2/baichuan2.md) | [text_generation](https://gitee.com/mindspore/mindformers/blob/dev/docs/task_cards/text_generation.md) | baichuan2_7b
baichuan2_13b | | [Internlm](../research/internlm/internlm.md) | [text_generation](https://gitee.com/mindspore/mindformers/blob/dev/docs/task_cards/text_generation.md) | InternLM-7B | | [ziya](../research/ziya/ziya.md) | [text_generation](https://gitee.com/mindspore/mindformers/blob/dev/docs/task_cards/text_generation.md) | ziya-13B | ### Text Generator支持度表 | model | 模型文档链接 | 增量推理 | 流式推理 | | :---------: | :-------------------------------------------: | :------: | :------: | | bloom | [link](../model_cards/bloom.md) | √ | √ | | GLM | [link](../model_cards/glm.md) | √ | √ | | GLM2 | [link](../model_cards/glm2.md) | √ | √ | | GPT | [link](../model_cards/gpt2.md) | √ | √ | | llama | [link](../model_cards/llama.md) | √ | √ | | llama2 | [link](../model_cards/llama.md) | √ | √ | | pangu-alpha | [link](../model_cards/pangualpha.md) | √ | √ | | T5 | [link](../model_cards/t5.md) | × | √ | | research | research | research | research | | baichuan | [link](../../research/baichuan/baichuan.md) | √ | √ | | baichuan2 | [link](../../research/baichuan2/baichuan2.md) | √ | √ | | internlm | [link](../../research/internlm/internlm.md) | √ | √ | | ziya | [link](../../research/ziya/ziya.md) | √ | √ | ### 边训练边评估支持度表 | 模型 | 评估指标 | 可用Model.eval完成评估 | 是否支持 | 数据并行模式 | 半自动并行模式 | | ----------- | :-------------------: | :--------------------: | -------- | ------------ | :---------------: | | bert | - | - | - | - | - | | blip2 | - | - | - | - | - | | bloom | - | - | - | - | - | | clip | - | - | - | - | - | | filip | - | - | - | - | - | | glm | Rouge,Bleu | 否 | 否 | × | × | | gpt2 | PPL | 是 | 是 | √ | √ | | llama | PPL | 是 | 是 | √ | √(7b 至少8卡) | | llama2 | PPL | 是 | 是 | √ | √(7b 至少8卡) | | MAE | 暂缺 | - | - | - | - | | pangu alpha | PPL | 是 | 是 | √ | √ | | qa-bert | f1, precision, recall | 是 | 是 | √ | × | | swin | Accuracy | 是 | 是 | √ | × | | t5 | 暂缺 | - | - | - | - | | tokcls-bert | f1, precision, recall | 是 | 是 | √ | × | | txtcls-bert | Accuracy | 是 | 是 | √ | × | | vit | Accuracy | 是 | 是 | √ | × | | research | research | research | research | research | research | | baichuan | PPL | 是 | 是 | √ | √(7b 至少8卡) | | baichuan2 | PPL | 是 | 是 | √ | √(7b 至少8卡) | | internlm | PPL | 是 | 是 | √ | √(7b 至少8卡) | | ziya | PPL | 是 | 是 | √ | √(13b 至少16卡) | ### 微调支持列表 | 模型 | 微调算法 | 运行模式 | | :------------------------------: | :------: | :---------------------: | | [GPT2](../model_cards/gpt2.md) | Lora | finetune、eval、predict | | [LLama](../model_cards/llama.md) | Lora | finetune、eval、predict | | [GLM](../model_cards/glm.md) | Lora | finetune、eval、predict | | [GLM2](../model_cards/glm2.md) | Lora | finetune、eval、predict | ### Chat Web支持列表 | 模型 | 规格 | 分词器 | 增量推理 | | ----- | ------------- | ------------- | -------- | | GLM | glm_6b | glm_6b | 支持 | | GLM2 | glm2_6b | glm2_6b | 支持 | | BLOOM | bloom_7.1b | bloom_7.1b | 支持 | | LLAMA | llama_7b_lora | llama_7b_lora | 支持 | ### 其余库上模型分布式支持情况一览表 | 模型 | 并行模式 | 数据并行 | 优化器并行 | 模型并行 | 流水并行 | 多副本并行 | | ----- | ------------- | -------- | ---------- | -------- | -------- | ---------- | | Bert | data_parallel | 是 | 是 | 否 | 否 | 否 | | BLIP2 | data_parallel | 是 | 是 | 否 | 否 | 否 | | CLIP | data_parallel | 是 | 是 | 否 | 否 | 否 | | MAE | data_parallel | 是 | 是 | 否 | 否 | 否 | | Swin | data_parallel | 是 | 是 | 否 | 否 | 否 | | T5 | data_parallel | 是 | 是 | 否 | 否 | 否 | | VIT | data_parallel | 是 | 是 | 否 | 否 | 否 |