PARADE contains paraphrases that overlap very little at the lexical and syntactic level but are semantically equivalent based on computer science domain knowledge, as well as non-paraphrases that overlap greatly at the lexical and syntactic level but are not semantically equivalent based on this domain knowledge.
Source: PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain KnowledgePaper | Code | Results | Date | Stars |
---|