Natural Language Inference

730 papers with code • 43 benchmarks • 77 datasets

Natural language inference (NLI) is the task of determining whether a "hypothesis" is true (entailment), false (contradiction), or undetermined (neutral) given a "premise".

Example:

Premise	Label	Hypothesis
A man inspects the uniform of a figure in some East Asian country.	contradiction	The man is sleeping.
An older and younger man smiling.	neutral	Two men are smiling and laughing at the cats playing on the floor.
A soccer game with multiple males playing.	entailment	Some men are playing a sport.

Approaches used for NLI include earlier symbolic and statistical approaches to more recent deep learning approaches. Benchmark datasets used for NLI include SNLI, MultiNLI, SciTail, among others. You can get hands-on practice on the SNLI task by following this d2l.ai chapter.

Benchmarks

Add a Result

These leaderboards are used to track progress in Natural Language Inference

Dataset	Best Model	Compare
SNLI	RoBERTa-large 355M + Entailment as Few-shot Learner	See all
RTE	Vega v2 6B (KD-based prompt transfer)	See all
MultiNLI	Turing NLR v5 XXL 5.4B (fine-tuned)	See all
QNLI	ALBERT	See all
ANLI test	T5-3B (explanation prompting)	See all
WNLI	Turing NLR v5 XXL 5.4B (fine-tuned)	See all
CommitmentBank	PaLM 540B (finetuned)	See all
SciTail	CA-MTL	See all
MultiNLI Dev	TinyBERT-6 67M	See all
FarsTail	mBERT	See all
MedNLI	SciFive-large	See all
TERRa	Human Benchmark	See all
LiDiRus	Human Benchmark	See all
RCB	Human Benchmark	See all
XNLI French	FlauBERT (large)	See all
V-SNLI	V-BiMPM	See all
XNLI Chinese Dev	ERNIE 2.0 Base	See all
XNLI Chinese	ERNIE 2.0 Large	See all
Quora Question Pairs	aESIM	See all
SICK	NeuralLog	See all
MED	NeuralLog	See all
KUAKE-QQR	BERT-base	See all
KUAKE-QTR	MacBERT-large	See all
XWINO	mGPT	See all
MRPC	DeBERTaV3large	See all
HANS	Roberta-large	See all
BioNLI	BioLinkBert	See all
AX	T5	See all
MNLI + SNLI + ANLI + FEVER	SMARTRoBERTa-LARGE	See all
e-SNLI	ExplainThenPredictAttention (e-InferSent Bi-LSTM + Attention)	See all
Probability words NLI	roberta-base-mnli	See all

Show all 34 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Natural Language Inference models and implementations

huggingface/transformers

14 papers

124,984

namisan/mt-dnn

5 papers

2,199

dmlc/gluon-nlp

4 papers

2,548

mynlp/ccg2lambda

4 papers

229

See all 17 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Automated Long Answer Grading with RiceChem Dataset

no code yet • 22 Apr 2024

With this work, we offer a fresh perspective on grading long, fact-based answers and introduce a new dataset to stimulate further research in this important area.

Paper
Add Code

Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits

no code yet • 22 Apr 2024

We subsequently train language models to identify entailment, contradiction, and neutrality from student response, akin to NLI, and with the added dimension of identifying omissions from gold answers.

Paper
Add Code

Explanation based Bias Decoupling Regularization for Natural Language Inference

no code yet • 20 Apr 2024

The robustness of Transformer-based Natural Language Inference encoders is frequently compromised as they tend to rely more on dataset biases than on the intended task-relevant features.

Paper
Add Code

How often are errors in natural language reasoning due to paraphrastic variability?

no code yet • 17 Apr 2024

We propose a metric for evaluating the paraphrastic consistency of natural language reasoning models based on the probability of a model achieving the same correctness on two paraphrases of the same problem.

Paper
Add Code

DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

no code yet • 14 Apr 2024

Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.

Paper
Add Code

MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference

no code yet • 11 Apr 2024

Furthermore, we show that domain shift degrades the performance of scientific NLI models which demonstrates the diverse characteristics of different domains in our dataset.

Paper
Add Code

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials

no code yet • 7 Apr 2024

Addressing this, we present SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for ClinicalTrials.

Paper
Add Code

A Morphology-Based Investigation of Positional Encodings

no code yet • 6 Apr 2024

How does the importance of positional encoding in pre-trained language models (PLMs) vary across languages with different morphological complexity?

Paper
Add Code

SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials

no code yet • 5 Apr 2024

This paper describes our submission to Task 2 of SemEval-2024: Safe Biomedical Natural Language Inference for Clinical Trials.

Paper
Add Code

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference

no code yet • 3 Apr 2024

Integer Linear Programming (ILP) has been proposed as a formalism for encoding precise structural and semantic constraints for Natural Language Inference (NLI).

Paper
Add Code

Natural Language Inference

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result