Transformers 文档

MobileBERT

Transformers

MobileBERT

概述

MobileBERT模型由Zhiqing Sun、Hongkun Yu、Xiaodan Song、Renjie Liu、Yiming Yang和Denny Zhou在MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices中提出。它是一个基于BERT模型的双向变压器，通过多种方法进行了压缩和加速。

论文的摘要如下：

自然语言处理（NLP）最近通过使用具有数亿参数的巨大预训练模型取得了巨大成功。然而，这些模型存在模型大小过大和延迟高的问题，因此无法部署到资源有限的移动设备上。在本文中，我们提出了MobileBERT，用于压缩和加速流行的BERT模型。与原始的BERT一样，MobileBERT是任务无关的，即它可以通过简单的微调广泛应用于各种下游NLP任务。基本上，MobileBERT是BERT_LARGE的瘦身版本，同时配备了瓶颈结构，并在自注意力和前馈网络之间进行了精心设计的平衡。为了训练MobileBERT，我们首先训练了一个特别设计的教师模型，即一个包含倒置瓶颈的BERT_LARGE模型。然后，我们从该教师模型向MobileBERT进行知识转移。实证研究表明，MobileBERT比BERT_BASE小4.3倍，快5.5倍，同时在知名基准测试中取得了有竞争力的结果。在GLUE的自然语言推理任务中，MobileBERT获得了77.7的GLUE分数（比BERT_BASE低0.6），在Pixel 4手机上的延迟为62毫秒。在SQuAD v1.1/v2.0问答任务中，MobileBERT的开发F1分数为90.0/79.2（比BERT_BASE高1.5/2.1）。

该模型由vshampor贡献。原始代码可以在这里找到。

使用提示

MobileBERT 是一个具有绝对位置嵌入的模型，因此通常建议在右侧而不是左侧填充输入。
MobileBERT 类似于 BERT，因此依赖于掩码语言建模（MLM）目标。因此，它在预测掩码标记和一般自然语言理解（NLU）方面非常高效，但在文本生成方面并不最优。使用因果语言建模（CLM）目标训练的模型在这方面表现更好。

Transformers

MobileBERT

概述

使用提示

资源

MobileBertConfig

类 transformers.MobileBertConfig

MobileBertTokenizer

类 transformers.MobileBertTokenizer

build_inputs_with_special_tokens

convert_tokens_to_string

create_token_type_ids_from_sequences

get_special_tokens_mask

MobileBertTokenizerFast

类 transformers.MobileBertTokenizerFast

build_inputs_with_special_tokens

create_token_type_ids_from_sequences

MobileBert 特定输出

类 transformers.models.mobilebert.modeling_mobilebert.MobileBertForPreTrainingOutput

类 transformers.models.mobilebert.modeling_tf_mobilebert.TFMobileBertForPreTrainingOutput

MobileBertModel

类 transformers.MobileBertModel

前进

MobileBertForPreTraining

类 transformers.MobileBertForPreTraining

前进

MobileBertForMaskedLM

类 transformers.MobileBertForMaskedLM

前进

MobileBertForNextSentencePrediction

类 transformers.MobileBertForNextSentencePrediction

前进

MobileBertForSequenceClassification

类 transformers.MobileBertForSequenceClassification

前进

MobileBertForMultipleChoice

类 transformers.MobileBertForMultipleChoice

前进

MobileBertForTokenClassification

类 transformers.MobileBertForTokenClassification

前进

MobileBertForQuestionAnswering

类 transformers.MobileBertForQuestionAnswering

前进

TFMobileBertModel

类 transformers.TFMobileBertModel

调用

TFMobileBertForPreTraining

类 transformers.TFMobileBertForPreTraining

调用

TFMobileBertForMaskedLM

类 transformers.TFMobileBertForMaskedLM

调用

TFMobileBertForNextSentencePrediction

类 transformers.TFMobileBertForNextSentencePrediction

调用

TFMobileBertForSequenceClassification

类 transformers.TFMobileBertForSequenceClassification

调用

TFMobileBertForMultipleChoice

类 transformers.TFMobileBertForMultipleChoice

调用

TFMobileBertForTokenClassification

类 transformers.TFMobileBertForTokenClassification

调用

TFMobileBertForQuestionAnswering

类 transformers.TFMobileBertForQuestionAnswering

调用