XLNet

Introduced by Yang et al. in XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet is an autoregressive Transformer that leverages the best of both autoregressive language modeling and autoencoding while attempting to avoid their limitations. Instead of using a fixed forward or backward factorization order as in conventional autoregressive models, XLNet maximizes the expected log likelihood of a sequence w.r.t. all possible permutations of the factorization order. Thanks to the permutation operation, the context for each position can consist of tokens from both left and right. In expectation, each position learns to utilize contextual information from all positions, i.e., capturing bidirectional context.

Additionally, inspired by the latest advancements in autogressive language modeling, XLNet integrates the segment recurrence mechanism and relative encoding scheme of Transformer-XL into pretraining, which empirically improves the performance especially for tasks involving a longer text sequence.

Source: XLNet: Generalized Autoregressive Pretraining for Language Understanding

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Language Modelling	23	7.47%
Sentence	20	6.49%
Question Answering	17	5.52%
Sentiment Analysis	15	4.87%
Text Classification	11	3.57%
Reading Comprehension	11	3.57%
Natural Language Understanding	8	2.60%
General Classification	8	2.60%
Text Generation	7	2.27%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Adam	Stochastic Optimization
Dense Connections	Feedforward Networks
Dropout	Regularization
GELU	Activation Functions
Layer Normalization	Normalization
Linear Warmup With Linear Decay	Learning Rate Schedules
Multi-Head Attention	Attention Modules
Residual Connection	Skip Connections
Scaled Dot-Product Attention	Attention Mechanisms
SentencePiece	Tokenizers
Softmax	Output Functions

Categories

Add Remove

Transformers

Autoregressive Transformers