TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Linguistic Acceptability	CoLA	LM-CPPF RoBERTa-base	Accuracy	14.1%	# 42
Sentiment Analysis	CR	LM-CPPF RoBERTa-base	Accuracy	93.3	# 2
Natural Language Inference	MultiNLI	LM-CPPF RoBERTa-base	Accuracy	68.4	# 4
Natural Language Inference	QNLI	LM-CPPF RoBERTa-base	Accuracy	70.2%	# 41
Sentiment Analysis	SST-2 Binary classification	LM-CPPF RoBERTa-base	Accuracy	93.2	# 40
Sentiment Analysis	SST-5 Fine-grained classification	LM-CPPF RoBERTa-base	Accuracy	54.9	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/sentiment-analysis-on-cr)](https://paperswithcode.com/sota/sentiment-analysis-on-cr?p=lm-cppf-paraphrasing-guided-data-augmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/natural-language-inference-on-multinli)](https://paperswithcode.com/sota/natural-language-inference-on-multinli?p=lm-cppf-paraphrasing-guided-data-augmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=lm-cppf-paraphrasing-guided-data-augmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/sentiment-analysis-on-sst-2-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-2-binary?p=lm-cppf-paraphrasing-guided-data-augmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/natural-language-inference-on-qnli)](https://paperswithcode.com/sota/natural-language-inference-on-qnli?p=lm-cppf-paraphrasing-guided-data-augmentation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lm-cppf-paraphrasing-guided-data-augmentation/linguistic-acceptability-on-cola)](https://paperswithcode.com/sota/linguistic-acceptability-on-cola?p=lm-cppf-paraphrasing-guided-data-augmentation)`

LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning

29 May 2023 · Amirhossein Abaskohi, Sascha Rothe, Yadollah Yaghoobzadeh ·

In recent years, there has been significant progress in developing pre-trained language models for NLP. However, these models often struggle when fine-tuned on small datasets. To address this issue, researchers have proposed various adaptation approaches. Prompt-based tuning is arguably the most common way, especially for larger models. Previous research shows that adding contrastive learning to prompt-based fine-tuning is effective as it helps the model generate embeddings that are more distinguishable between classes, and it can also be more sample-efficient as the model learns from positive and negative examples simultaneously. One of the most important components of contrastive learning is data augmentation, but unlike computer vision, effective data augmentation for NLP is still challenging. This paper proposes LM-CPPF, Contrastive Paraphrasing-guided Prompt-based Fine-tuning of Language Models, which leverages prompt-based few-shot paraphrasing using generative language models, especially large language models such as GPT-3 and OPT-175B, for data augmentation. Our experiments on multiple text classification benchmarks show that this augmentation method outperforms other methods, such as easy data augmentation, back translation, and multiple templates.

PDF Abstract

Code

Add Remove Mark official

amirabaskohi/lm-cppf official

Tasks

Add Remove

Contrastive Learning

Data Augmentation

Linguistic Acceptability

Natural Language Inference

Sentiment Analysis

text-classification

Text Classification

Datasets

GLUE

SST

MultiNLI SST-2

QNLI

CoLA SST-5

PARANMT-50M

Results from the Paper

Edit

Ranked #2 on Sentiment Analysis on CR

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Linguistic Acceptability	CoLA	LM-CPPF RoBERTa-base	Accuracy	14.1%	# 42	Compare
Sentiment Analysis	CR	LM-CPPF RoBERTa-base	Accuracy	93.3	# 2	Compare
Natural Language Inference	MultiNLI	LM-CPPF RoBERTa-base	Accuracy	68.4	# 4	Compare
Natural Language Inference	QNLI	LM-CPPF RoBERTa-base	Accuracy	70.2%	# 41	Compare
Sentiment Analysis	SST-2 Binary classification	LM-CPPF RoBERTa-base	Accuracy	93.2	# 40	Compare
Sentiment Analysis	SST-5 Fine-grained classification	LM-CPPF RoBERTa-base	Accuracy	54.9	# 6	Compare

Methods

Add Remove

Adam • Attention Dropout • BPE • Contrastive Learning • Cosine Annealing • Dense Connections • Dropout • Fixed Factorized Attention • GELU • GPT-3 • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Strided Attention • Weight Decay

Edit Social Preview

LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove