DeeBERT

Introduced by Xin et al. in DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

DeeBERT is a method for accelerating BERT inference. It inserts extra classification layers (which are referred to as off-ramps) between each transformer layer of BERT. All transformer layers and off-ramps are jointly fine-tuned on a given downstream dataset. At inference time, after a sample goes through a transformer layer, it is passed to the following off-ramp. If the off-ramp is confident of the prediction, the result is returned; otherwise, the sample is sent to the next transformer layer.

Source: DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Natural Language Inference	1	33.33%
Paraphrase Identification	1	33.33%
Natural Language Understanding	1	33.33%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
BERT	Language Models

Categories

Add Remove

Transformers

Autoencoding Transformers