TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Natural Language Inference	ANLI test	T5-3B (explanation prompting)	A1	81.8	# 1
Natural Language Inference	ANLI test	T5-3B (explanation prompting)	A2	72.5	# 1
Natural Language Inference	ANLI test	T5-3B (explanation prompting)	A3	74.8	# 1
Natural Language Inference	ANLI test	T0-11B (explanation prompting)	A1	75.6	# 2
Natural Language Inference	ANLI test	T0-11B (explanation prompting)	A2	60.6	# 7
Natural Language Inference	ANLI test	T0-11B (explanation prompting)	A3	59.9	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/prompting-for-explanations-improves/natural-language-inference-on-anli-test)](https://paperswithcode.com/sota/natural-language-inference-on-anli-test?p=prompting-for-explanations-improves)`

Prompting for explanations improves Adversarial NLI. Is this true? {Yes} it is {true} because {it weakens superficial cues}

EACL 2023 · Pride Kavumba, Ana Brassard, Benjamin Heinzerling, Kentaro Inui ·

Explanation prompts ask language models to not only assign a particular label to a giveninput, such as true, entailment, or contradiction in the case of natural language inference but also to generate a free-text explanation that supports this label. For example: “This is label because explanation.” While this type of prompt was originally introduced with the aim of improving model interpretability, we showhere that explanation prompts also improve robustness to adversarial perturbations in naturallanguage inference benchmarks. Compared to prompting for labels only, explanation prompting consistently yields stronger performance on adversarial benchmarks, outperforming the state of the art on Adversarial Natural Language Inference, Counterfactually-Augmented Natural Language Inference, and SNLI-Hard datasets. We argue that the increase in robustness is due to the fact that prompting for explanations weakens superficial cues. Specifically, single tokens that are highly predictive of the correct answer in the label-only setting become uninformative when the model also has to generate explanations.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Natural Language Inference

Datasets

SNLI

BookCorpus

ANLI

e-SNLI

Results from the Paper

Add Remove

Ranked #1 on Natural Language Inference on ANLI test

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Natural Language Inference	ANLI test	T5-3B (explanation prompting)	A1	81.8	# 1	Compare
			A2	72.5	# 1	Compare
			A3	74.8	# 1	Compare
Natural Language Inference	ANLI test	T0-11B (explanation prompting)	A1	75.6	# 2	Compare
			A2	60.6	# 7	Compare
			A3	59.9	# 8	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Prompting for explanations improves Adversarial NLI. Is this true? {Yes} it is {true} because {it weakens superficial cues}

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove