About

Benchmarks

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Subtasks

Datasets

Greatest papers with code

Toward Better Storylines with Sentence-Level Language Models

ACL 2020 google-research/google-research

We propose a sentence-level language model which selects the next sentence in a story from a finite set of fluent alternatives.

LANGUAGE MODELLING SENTENCE EMBEDDINGS WORD EMBEDDINGS

Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation

EMNLP 2020 UKPLab/sentence-transformers

The training is based on the idea that a translated sentence should be mapped to the same location in the vector space as the original sentence.

KNOWLEDGE DISTILLATION SENTENCE EMBEDDING

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

IJCNLP 2019 UKPLab/sentence-transformers

However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10, 000 sentences requires about 50 million inference computations (~65 hours) with BERT.

Ranked #6 on Semantic Textual Similarity on STS Benchmark (Spearman Correlation metric)

SEMANTIC SIMILARITY SEMANTIC TEXTUAL SIMILARITY SENTENCE EMBEDDINGS TRANSFER LEARNING

WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia

10 Jul 2019facebookresearch/LASER

We present an approach based on multilingual sentence embeddings to automatically extract parallel sentences from the content of Wikipedia articles in 85 languages, including several dialects or low-resource languages.

SENTENCE EMBEDDINGS

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings

ACL 2019 facebookresearch/LASER

Machine translation is highly sensitive to the size and quality of the training data, which has led to an increasing interest in collecting and filtering large parallel corpora.

CROSS-LINGUAL BITEXT MINING MACHINE TRANSLATION PARALLEL CORPUS MINING SENTENCE EMBEDDINGS

What you can cram into a single vector: Probing sentence embeddings for linguistic properties

3 May 2018facebookresearch/InferSent

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

SENTENCE CLASSIFICATION SENTENCE EMBEDDINGS

Universal Sentence Encoder

29 Mar 2018facebookresearch/InferSent

For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance.

CONVERSATIONAL RESPONSE SELECTION SEMANTIC TEXTUAL SIMILARITY SENTENCE EMBEDDINGS SENTIMENT ANALYSIS SUBJECTIVITY ANALYSIS TEXT CLASSIFICATION TRANSFER LEARNING WORD EMBEDDINGS

DisSent: Sentence Representation Learning from Explicit Discourse Relations

12 Oct 2017facebookresearch/InferSent

Learning effective representations of sentences is one of the core missions of natural language understanding.

DEPENDENCY PARSING NATURAL LANGUAGE UNDERSTANDING SENTENCE EMBEDDINGS

Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks

15 Aug 2016facebookresearch/InferSent

The analysis sheds light on the relative strengths of different sentence embedding methods with respect to these low level prediction tasks, and on the effect of the encoded vector's dimensionality on the resulting representations.

SENTENCE EMBEDDING