mindformers.models.BaseModel¶
-
class
mindformers.models.BaseModel(config: mindformers.models.base_config.BaseConfig, **kwargs)[源代码]¶ The base model that contains the class method from_pretained and save_pretrained, any new model that should inherit the class.
注解
GeneratorMixin provides the method generate that enable the generation for nlp models.
- 参数
config (BaseConfig) – The model configuration that inherits the BaseConfig.
-
classmethod
from_pretrained(pretrained_model_name_or_dir: str, **kwargs)[源代码]¶ Instantiates a model by the pretrained_model_name_or_dir. It download the model weights if the user pass a model name, or load the weight from the given directory if given the path. (only support standalone mode, and distribute mode waits for developing!)
- 参数
pretrained_model_name_or_dir (str) – It supports the following two input types. If pretrained_model_name_or_dir is a supported model name, for example, vit_base_p16 and t5_small, it will download the necessary files from the cloud. User can pass one from the support list by call MindFormerBook.get_model_support_list(). If pretrained_model_name_or_dir is a path to the local directory where there should have model weights ended with .ckpt and configuration file ended with yaml.
pretrained_model_name_or_path (Optional[str]) – Equal to “pretrained_model_name_or_dir”, if “pretrained_model_name_or_path” is set, “pretrained_model_name_or_dir” is useless.
实际案例
>>> from mindformers import T5ForConditionalGeneration >>> net = T5ForConditionalGeneration.from_pretrained('t5_small')
- 返回
A model, which inherited from BaseModel.
-
load_checkpoint(config)[源代码]¶ load checkpoint for models. (only support standalone mode, and distribute mode waits for developing)
- 参数
config (ModelConfig) – a model config instance, which could have attribute
"checkpoint_name_or_path (str) –
name or a path to checkpoint, to load model weights. (model) –
-
save_pretrained(save_directory: Optional[str] = None, save_name: str = 'mindspore_model')[源代码]¶ Save the model weight and configuration file. (only supports standalone mode, and distribute mode waits for developing)
- 参数
save_directory (str) – a directory to save the model weight and configuration. If None, the directory will be ./checkpoint_save, which can be obtained by the MindFormerBook.get_default_checkpoint_save_folder(). If set, the directory will be what is set.
save_name (str) – the name of saved files, including model weight and configuration file. Default mindspore_model.
实际案例
>>> import os >>> from mindformers import T5ForConditionalGeneration, MindFormerBook >>> net = T5ForConditionalGeneration.from_pretrained('t5_small') >>> net.save_pretrained() >>> output_path = MindFormerBook.get_default_checkpoint_save_folder() >>> print(os.listdir(output_path)) ['mindspore_model.yaml', 'mindspore_model.ckpt']