Paraphrase Identification

72 papers with code • 10 benchmarks • 17 datasets

The goal of Paraphrase Identification is to determine whether a pair of sentences have the same meaning.

Source: Adversarial Examples with Difficult Common Words for Paraphrase Identification

Image source: On Paraphrase Identification Corpora

Libraries

Use these libraries to find Paraphrase Identification models and implementations

Factorising Meaning and Form for Intent-Preserving Paraphrasing

tomhosking/separator ACL 2021

We propose a method for generating paraphrases of English questions that retain the original intent but use a different surface form.

27
31 May 2021

FNet: Mixing Tokens with Fourier Transforms

labmlai/annotated_deep_learning_paper_implementations NAACL 2022

At longer input lengths, our FNet model is significantly faster: when compared to the "efficient" Transformers on the Long Range Arena benchmark, FNet matches the accuracy of the most accurate models, while outpacing the fastest models across all sequence lengths on GPUs (and across relatively shorter lengths on TPUs).

48,540
09 May 2021

Entailment as Few-Shot Learner

PaddlePaddle/PaddleNLP 29 Apr 2021

Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners.

11,453
29 Apr 2021

TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning

UKPLab/sentence-transformers 14 Apr 2021

Learning sentence embeddings often requires a large amount of labeled data.

13,848
14 Apr 2021

Explaining Neural Network Predictions on Sentence Pairs via Learning Word-Group Masks

UVa-NLP/GMASK NAACL 2021

Most existing methods generate post-hoc explanations for neural network models by identifying individual feature attributions or detecting interactions between adjacent features.

4
09 Apr 2021

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

rabeehk/compacter ACL 2021

Although pretrained language models can be fine-tuned to produce state-of-the-art results for a very wide range of language understanding tasks, the dynamics of this process are not well understood, especially in the low data regime.

122
22 Dec 2020

Self-Explaining Structures Improve NLP Models

ShannonAI/Self_Explaining_Structures_Improve_NLP_Models 3 Dec 2020

The proposed model comes with the following merits: (1) span weights make the model self-explainable and do not require an additional probing model for interpretation; (2) the proposed model is general and can be adapted to any existing deep learning structures in NLP; (3) the weight associated with each text span provides direct importance scores for higher-level text units such as phrases and sentences.

80
03 Dec 2020

Adversarial Semantic Collisions

csong27/collision-bert EMNLP 2020

We study semantic collisions: texts that are semantically unrelated but judged as similar by NLP models.

25
09 Nov 2020

PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge

heyunh2015/PARADE_dataset EMNLP 2020

We present a new benchmark dataset called PARADE for paraphrase identification that requires specialized domain knowledge.

9
08 Oct 2020