Paraphrase Identification

72 papers with code • 10 benchmarks • 17 datasets

The goal of Paraphrase Identification is to determine whether a pair of sentences have the same meaning.

Source: Adversarial Examples with Difficult Common Words for Paraphrase Identification

Image source: On Paraphrase Identification Corpora

Benchmarks

Add a Result

These leaderboards are used to track progress in Paraphrase Identification

Dataset	Best Model	Compare
Quora Question Pairs	ALICE	See all
MSRP	FEAT2, TFKLD, SVM, Fine-grained features	See all
Quora Question Pairs Dev	BERT + SCH attm	See all
2017_test set	CNN	See all
WikiHop	StructBERTRoBERTa ensemble	See all
TURL	TSDAE	See all
PIT	TSDAE	See all
AP	RoBETRa base	See all
IMDb	SplitEE-S	See all
Yelp	SplitEE-S	See all

Libraries

Use these libraries to find Paraphrase Identification models and implementations

huggingface/transformers

3 papers

124,984

kaushaltrivedi/fast-bert

3 papers

1,846

utterworks/fast-bert

3 papers

1,846

labmlai/annotated_deep_learning_pap…

2 papers

48,096

See all 10 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

google-research/bert • • NAACL 2019

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers.

528

Paper
Code

XLNet: Generalized Autoregressive Pretraining for Language Understanding

zihangdai/xlnet • • NeurIPS 2019

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling.

Paper
Code

FNet: Mixing Tokens with Fourier Transforms

google-research/google-research • • NAACL 2022

At longer input lengths, our FNet model is significantly faster: when compared to the "efficient" Transformers on the Long Range Arena benchmark, FNet matches the accuracy of the most accurate models, while outpacing the fastest models across all sequence lengths on GPUs (and across relatively shorter lengths on TPUs).

Paper
Code

Bilateral Multi-Perspective Matching for Natural Language Sentences

google-research-datasets/paws • 13 Feb 2017

Natural language sentence matching is a fundamental technology for a variety of tasks.

Paper
Code

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

pytorch/fairseq • • Preprint 2022

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind.

Paper
Code

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

yinwenpeng/Answer_Selection • TACL 2016

(ii) We propose three attention schemes that integrate mutual influence between sentences into CNN; thus, the representation of each sentence takes into consideration its counterpart.

Paper
Code

Multi-Task Deep Neural Networks for Natural Language Understanding

namisan/mt-dnn • • ACL 2019

In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks.

Paper
Code

TinyBERT: Distilling BERT for Natural Language Understanding

huawei-noah/Pretrained-Language-Model • • Findings of the Association for Computational Linguistics 2020

To accelerate inference and reduce model size while maintaining accuracy, we first propose a novel Transformer distillation method that is specially designed for knowledge distillation (KD) of the Transformer-based models.

Paper
Code