MPNet

Introduced by Song et al. in MPNet: Masked and Permuted Pre-training for Language Understanding

MPNet is a pre-training method for language models that combines masked language modeling (MLM) and permuted language modeling (PLM) in one view. It takes the dependency among the predicted tokens into consideration through permuted language modeling and thus avoids the issue of BERT. On the other hand, it takes position information of all tokens as input to make the model see the position information of all the tokens and thus alleviates the position discrepancy of XLNet.

The training objective of MPNet is:

$$ \mathbb{E}_{z\in{\mathcal{Z}_{n}}} \sum^{n}_{t=c+1}\log{P}\left(x_{z_{t}}\mid{x_{z_{<t}}}, M\_{z\_{{>}{c}}}; \theta\right) $$

As can be seen, MPNet conditions on ${x_{z_{<t}}}$ (the tokens preceding the current predicted token $x_{z_{t}}$) rather than only the non-predicted tokens ${x_{z_{<=c}}}$ in MLM; comparing with PLM, MPNet takes more information (i.e., the mask symbol $[M]$ in position $z_{>c}$) as inputs. Although the objective seems simple, it is challenging to implement the model efficiently. For details, see the paper.

Source: MPNet: Masked and Permuted Pre-training for Language Understanding

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Sentence	6	18.75%
Language Modelling	4	12.50%
Recommendation Systems	2	6.25%
Large Language Model	2	6.25%
Misinformation	2	6.25%
Benchmarking	1	3.13%
Collaborative Filtering	1	3.13%
Graph Embedding	1	3.13%
Knowledge Graph Embedding	1	3.13%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Language Model Pre-Training