mindformers.models.BaseModel

class mindformers.models.BaseModel(config: mindformers.models.base_config.BaseConfig, **kwargs)[源代码]

The base model that contains the class method from_pretained and save_pretrained, any new model that should inherit the class.

注解

GeneratorMixin provides the method generate that enable the generation for nlp models.

参数

config (BaseConfig) – The model configuration that inherits the BaseConfig.

classmethod from_pretrained(pretrained_model_name_or_dir: str, **kwargs)[源代码]

Instantiates a model by the pretrained_model_name_or_dir. It download the model weights if the user pass a model name, or load the weight from the given directory if given the path. (only support standalone mode, and distribute mode waits for developing!)

参数
  • pretrained_model_name_or_dir (str) – It supports the following two input types. If pretrained_model_name_or_dir is a supported model name, for example, vit_base_p16 and t5_small, it will download the necessary files from the cloud. User can pass one from the support list by call MindFormerBook.get_model_support_list(). If pretrained_model_name_or_dir is a path to the local directory where there should have model weights ended with .ckpt and configuration file ended with yaml.

  • pretrained_model_name_or_path (Optional[str]) – Equal to “pretrained_model_name_or_dir”, if “pretrained_model_name_or_path” is set, “pretrained_model_name_or_dir” is useless.

实际案例

>>> from mindformers import T5ForConditionalGeneration
>>> net = T5ForConditionalGeneration.from_pretrained('t5_small')
返回

A model, which inherited from BaseModel.

classmethod get_support_list()[源代码]

get_support_list method

load_checkpoint(config)[源代码]

load checkpoint for models. (only support standalone mode, and distribute mode waits for developing)

参数
  • config (ModelConfig) – a model config instance, which could have attribute

  • "checkpoint_name_or_path (str) –

  • name or a path to checkpoint, to load model weights. (model) –

remove_type(config)[源代码]

remove type caused by save’

save_pretrained(save_directory: Optional[str] = None, save_name: str = 'mindspore_model')[源代码]

Save the model weight and configuration file. (only supports standalone mode, and distribute mode waits for developing)

参数
  • save_directory (str) – a directory to save the model weight and configuration. If None, the directory will be ./checkpoint_save, which can be obtained by the MindFormerBook.get_default_checkpoint_save_folder(). If set, the directory will be what is set.

  • save_name (str) – the name of saved files, including model weight and configuration file. Default mindspore_model.

实际案例

>>> import os
>>> from mindformers import T5ForConditionalGeneration, MindFormerBook
>>> net = T5ForConditionalGeneration.from_pretrained('t5_small')
>>> net.save_pretrained()
>>> output_path = MindFormerBook.get_default_checkpoint_save_folder()
>>> print(os.listdir(output_path))
['mindspore_model.yaml', 'mindspore_model.ckpt']
classmethod show_support_list()[源代码]

show_support_list method