2 dataset results for Semantic Textual Similarity within Bi-Encoder AND Texts AND English

GLUE (General Language Understanding Evaluation benchmark)

General Language Understanding Evaluation (GLUE) benchmark is a collection of nine natural language understanding tasks, including single-sentence tasks CoLA and SST-2, similarity and paraphrasing tasks MRPC, STS-B and QQP, and natural language inference tasks MNLI, QNLI, RTE and WNLI.

2,729 PAPERS • 25 BENCHMARKS

MRPC (Microsoft Research Paraphrase Corpus)

Microsoft Research Paraphrase Corpus (MRPC) is a corpus consists of 5,801 sentence pairs collected from newswire articles. Each pair is labelled if it is a paraphrase or not by human annotators. The whole set is divided into a training subset (4,076 sentence pairs of which 2,753 are paraphrases) and a test subset (1,725 pairs of which 1,147 are paraphrases).

699 PAPERS • 5 BENCHMARKS

Datasets

2 dataset results for Semantic Textual Similarity within Bi-Encoder AND Texts AND English