Natural Language Inference

730 papers with code • 34 benchmarks • 77 datasets

Natural language inference (NLI) is the task of determining whether a "hypothesis" is true (entailment), false (contradiction), or undetermined (neutral) given a "premise".

Example:

Premise	Label	Hypothesis
A man inspects the uniform of a figure in some East Asian country.	contradiction	The man is sleeping.
An older and younger man smiling.	neutral	Two men are smiling and laughing at the cats playing on the floor.
A soccer game with multiple males playing.	entailment	Some men are playing a sport.

Approaches used for NLI include earlier symbolic and statistical approaches to more recent deep learning approaches. Benchmark datasets used for NLI include SNLI, MultiNLI, SciTail, among others. You can get hands-on practice on the SNLI task by following this d2l.ai chapter.

Benchmarks

Add a Result

These leaderboards are used to track progress in Natural Language Inference

Dataset	Best Model	Compare
SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	See all
RTE	Vega v2 6B (KD-based prompt transfer)	See all
MultiNLI	Turing NLR v5 XXL 5.4B (fine-tuned)	See all
QNLI	ALBERT	See all
ANLI test	T5-3B (explanation prompting)	See all
WNLI	Turing NLR v5 XXL 5.4B (fine-tuned)	See all
CommitmentBank	PaLM 540B (finetuned)	See all
SciTail	CA-MTL	See all
MultiNLI Dev	TinyBERT-6 67M	See all
FarsTail	mBERT	See all
MedNLI	SciFive-large	See all
TERRa	Human Benchmark	See all
LiDiRus	Human Benchmark	See all
RCB	Human Benchmark	See all
XNLI French	FlauBERT (large)	See all
V-SNLI	V-BiMPM	See all
XNLI Chinese Dev	ERNIE 2.0 Base	See all
XNLI Chinese	ERNIE 2.0 Large	See all
Quora Question Pairs	aESIM	See all
SICK	NeuralLog	See all
MED	NeuralLog	See all
KUAKE-QQR	BERT-base	See all
KUAKE-QTR	MacBERT-large	See all
XWINO	mGPT	See all
MRPC	DeBERTaV3large	See all
HANS	Roberta-large	See all
BioNLI	BioLinkBert	See all
AX	T5	See all
MNLI + SNLI + ANLI + FEVER	SMARTRoBERTa-LARGE	See all
e-SNLI	ExplainThenPredictAttention (e-InferSent Bi-LSTM + Attention)	See all
Probability words NLI	roberta-base-mnli	See all

Show all 34 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Natural Language Inference models and implementations

huggingface/transformers

14 papers

125,059

namisan/mt-dnn

5 papers

2,200

dmlc/gluon-nlp

4 papers

2,548

mynlp/ccg2lambda

4 papers

229

See all 17 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Big Bird: Transformers for Longer Sequences

google-research/bigbird • • NeurIPS 2020

To remedy this, we propose, BigBird, a sparse attention mechanism that reduces this quadratic dependency to linear.

Paper
Code

A Decomposable Attention Model for Natural Language Inference

dmlc/gluon-nlp • • EMNLP 2016

We propose a simple neural architecture for natural language inference.

Paper
Code

Bilateral Multi-Perspective Matching for Natural Language Sentences

google-research-datasets/paws • 13 Feb 2017

Natural language sentence matching is a fundamental technology for a variety of tasks.

Paper
Code

XNLI: Evaluating Cross-lingual Sentence Representations

facebookresearch/XLM • • EMNLP 2018

State-of-the-art natural language processing systems rely on supervision in the form of annotated data to learn competent models.

Paper
Code

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

PaddlePaddle/PaddleNLP • • 31 Aug 2019

The pre-trained language models have achieved great successes in various natural language understanding (NLU) tasks due to its capacity to capture the deep contextualized information in text by pre-training on large-scale corpora.

Paper
Code

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

microsoft/DeBERTa • • ICLR 2021

Recent progress in pre-trained neural language models has significantly improved the performance of many natural language processing (NLP) tasks.

Paper
Code

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

pytorch/fairseq • • Preprint 2022

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind.

Paper
Code

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

yinwenpeng/Answer_Selection • TACL 2016

(ii) We propose three attention schemes that integrate mutual influence between sentences into CNN; thus, the representation of each sentence takes into consideration its counterpart.

Paper
Code

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification

deep-spin/entmax • • 5 Feb 2016

We propose sparsemax, a new activation function similar to the traditional softmax, but able to output sparse probabilities.

Paper
Code

Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence

HSLCY/ABSA-BERT-pair • • NAACL 2019

Aspect-based sentiment analysis (ABSA), which aims to identify fine-grained opinion polarity towards a specific aspect, is a challenging subtask of sentiment analysis (SA).

Paper
Code

Natural Language Inference

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result