mindformers.models.bert.BertTokenizer

class mindformers.models.bert.BertTokenizer(vocab_file, do_lower_case=True, do_basic_tokenize=True, unk_token='[UNK]', sep_token='[SEP]', pad_token='[PAD]', cls_token='[CLS]', mask_token='[MASK]', is_tokenize_char=False, **kwargs)[源代码]

Bert Tokenizer.

save_vocabulary(save_directory, filename_prefix)[源代码]

write the word to the files

property vocab_size

Return the vocab size