mindformers.models.vit.ViTModel

class mindformers.models.vit.ViTModel(config=None)[源代码]

Vision Transformer with support for patch or hybrid CNN input stage. The supported model name could be selected from ViTConfig.show_support_list().

参数

config (ViTConfig) – the config of Vit model.

实际案例

>>> # input model name, load model and weights
>>> model_a = ViTModel.from_pretrained('vit_base_p16')
>>> # input config, load model without weights
>>> from mindformers import AutoConfig
>>> config = AutoConfig.from_pretrained('vit_base_p16')
>>> model_b = ViTModel(config)
construct_without_pool(image, mask=None)[源代码]

construct of vit without pool

fix_init_weight()[源代码]

fix init weight

init_weights_vit()[源代码]

init weights vit ViT weight initialization, original timm impl (for reproducibility)