Paraphrase Identification

72 papers with code • 10 benchmarks • 17 datasets

The goal of Paraphrase Identification is to determine whether a pair of sentences have the same meaning.

Source: Adversarial Examples with Difficult Common Words for Paraphrase Identification

Image source: On Paraphrase Identification Corpora

Benchmarks

Add a Result

These leaderboards are used to track progress in Paraphrase Identification

Dataset	Best Model	Compare
Quora Question Pairs	ALICE	See all
MSRP	FEAT2, TFKLD, SVM, Fine-grained features	See all
Quora Question Pairs Dev	BERT + SCH attm	See all
2017_test set	CNN	See all
WikiHop	StructBERTRoBERTa ensemble	See all
TURL	TSDAE	See all
PIT	TSDAE	See all
AP	RoBETRa base	See all
IMDb	SplitEE-S	See all
Yelp	SplitEE-S	See all

Libraries

Use these libraries to find Paraphrase Identification models and implementations

huggingface/transformers

3 papers

125,478

kaushaltrivedi/fast-bert

3 papers

1,847

utterworks/fast-bert

3 papers

1,847

labmlai/annotated_deep_learning_pap…

2 papers

48,540

See all 10 libraries.

Datasets

Latest papers

Most implemented Social Latest No code

Factorising Meaning and Form for Intent-Preserving Paraphrasing

tomhosking/separator • • ACL 2021

We propose a method for generating paraphrases of English questions that retain the original intent but use a different surface form.

31 May 2021

Paper
Code

FNet: Mixing Tokens with Fourier Transforms

labmlai/annotated_deep_learning_paper_implementations • • NAACL 2022

At longer input lengths, our FNet model is significantly faster: when compared to the "efficient" Transformers on the Long Range Arena benchmark, FNet matches the accuracy of the most accurate models, while outpacing the fastest models across all sequence lengths on GPUs (and across relatively shorter lengths on TPUs).

48,540

09 May 2021

Paper
Code

Entailment as Few-Shot Learner

PaddlePaddle/PaddleNLP • • 29 Apr 2021

Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners.

11,453

29 Apr 2021

Paper
Code

TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning

UKPLab/sentence-transformers • • 14 Apr 2021

Learning sentence embeddings often requires a large amount of labeled data.

13,848

14 Apr 2021

Paper
Code

Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks

UVa-NLP/GMASK • • NAACL 2021

Most existing methods generate post-hoc explanations for neural network models by identifying individual feature attributions or detecting interactions between adjacent features.

09 Apr 2021

Paper
Code

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

rabeehk/compacter • • ACL 2021

Although pretrained language models can be fine-tuned to produce state-of-the-art results for a very wide range of language understanding tasks, the dynamics of this process are not well understood, especially in the low data regime.

122

22 Dec 2020

Paper
Code

RealFormer: Transformer Likes Residual Attention

google-research/google-research • • Findings (ACL) 2021

Transformer is the backbone of modern NLP models.

32,890

21 Dec 2020

Paper
Code

Self-Explaining Structures Improve NLP Models

ShannonAI/Self_Explaining_Structures_Improve_NLP_Models • • 3 Dec 2020

The proposed model comes with the following merits: (1) span weights make the model self-explainable and do not require an additional probing model for interpretation; (2) the proposed model is general and can be adapted to any existing deep learning structures in NLP; (3) the weight associated with each text span provides direct importance scores for higher-level text units such as phrases and sentences.

03 Dec 2020

Paper
Code