TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Linguistic Acceptability	CoLA	MT-DNN	Accuracy	68.4%	# 18
Natural Language Inference	MultiNLI	MT-DNN	Matched	86.7	# 22
Natural Language Inference	MultiNLI	MT-DNN	Mismatched	86.0	# 16
Paraphrase Identification	Quora Question Pairs	MT-DNN	Accuracy	89.6	# 8
Paraphrase Identification	Quora Question Pairs	MT-DNN	F1	72.4	# 10
Natural Language Inference	SciTail	MT-DNN	Accuracy	94.1	# 2
Natural Language Inference	SNLI	MT-DNN	% Test Accuracy	91.6	# 8
Natural Language Inference	SNLI	MT-DNN	% Train Accuracy	97.2	# 4
Natural Language Inference	SNLI	MT-DNN	Parameters	330m	# 4
Natural Language Inference	SNLI	Ntumpha	% Test Accuracy	90.5	# 10
Natural Language Inference	SNLI	Ntumpha	% Train Accuracy	99.1	# 2
Natural Language Inference	SNLI	Ntumpha	Parameters	220	# 3
Sentiment Analysis	SST-2 Binary classification	MT-DNN	Accuracy	95.6	# 22

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/natural-language-inference-on-scitail)](https://paperswithcode.com/sota/natural-language-inference-on-scitail?p=multi-task-deep-neural-networks-for-natural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=multi-task-deep-neural-networks-for-natural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/paraphrase-identification-on-quora-question)](https://paperswithcode.com/sota/paraphrase-identification-on-quora-question?p=multi-task-deep-neural-networks-for-natural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/linguistic-acceptability-on-cola)](https://paperswithcode.com/sota/linguistic-acceptability-on-cola?p=multi-task-deep-neural-networks-for-natural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/natural-language-inference-on-multinli)](https://paperswithcode.com/sota/natural-language-inference-on-multinli?p=multi-task-deep-neural-networks-for-natural)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-task-deep-neural-networks-for-natural/sentiment-analysis-on-sst-2-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-2-binary?p=multi-task-deep-neural-networks-for-natural)`

Multi-Task Deep Neural Networks for Natural Language Understanding

ACL 2019 · Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao ·

In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks. MT-DNN not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations in order to adapt to new tasks and domains. MT-DNN extends the model proposed in Liu et al. (2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT (Devlin et al., 2018). MT-DNN obtains new state-of-the-art results on ten NLU tasks, including SNLI, SciTail, and eight out of nine GLUE tasks, pushing the GLUE benchmark to 82.7% (2.2% absolute improvement). We also demonstrate using the SNLI and SciTail datasets that the representations learned by MT-DNN allow domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations. The code and pre-trained models are publicly available at https://github.com/namisan/mt-dnn.

PDF Abstract ACL 2019 PDF ACL 2019 Abstract

Code

Add Remove Mark official

namisan/mt-dnn official

2,198

xycforgithub/MultiTask-MRC

100

ABaldrati/MT-BERT

phueb/CHILDES-SRL

phueb/BabyBertSRL

See all 7 implementations

Tasks

Add Remove

Domain Adaptation

Language Modelling

Linguistic Acceptability

Natural Language Inference

Natural Language Understanding

Paraphrase Identification

Sentiment Analysis

Datasets

GLUE

SST

MultiNLI SST-2

SNLI

QNLI

CoLA

Quora

Quora Question Pairs

SciTail

Results from the Paper

Edit

Ranked #2 on Natural Language Inference on SciTail

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Natural Language Inference	MultiNLI	MT-DNN	Matched	86.7	# 22	Compare
Natural Language Inference	MultiNLI	MT-DNN	Mismatched	86.0	# 16	Compare
Paraphrase Identification	Quora Question Pairs	MT-DNN	Accuracy	89.6	# 8	Compare
Paraphrase Identification	Quora Question Pairs	MT-DNN	F1	72.4	# 10	Compare
Natural Language Inference	SciTail	MT-DNN	Accuracy	94.1	# 2	Compare
Natural Language Inference	SNLI	MT-DNN	% Test Accuracy	91.6	# 8	Compare
			% Train Accuracy	97.2	# 4	Compare
			Parameters	330m	# 4	Compare
Natural Language Inference	SNLI	Ntumpha	% Test Accuracy	90.5	# 10	Compare
			% Train Accuracy	99.1	# 2	Compare
			Parameters	220	# 3	Compare
Sentiment Analysis	SST-2 Binary classification	MT-DNN	Accuracy	95.6	# 22	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
Linguistic Acceptability	CoLA	MT-DNN	Accuracy	68.4%	# 18		See all

Methods

Add Remove

Absolute Position Encodings • Adam • Attention Dropout • BERT • BPE • Dense Connections • Dropout • GELU • Label Smoothing • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Position-Wise Feed-Forward Layer • ReLU • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer • Weight Decay • WordPiece

Edit Social Preview

Multi-Task Deep Neural Networks for Natural Language Understanding

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit