SimCSE

Introduced by Gao et al. in SimCSE: Simple Contrastive Learning of Sentence Embeddings

SimCSE is a contrastive learning framework for generating sentence embeddings. It utilizes an unsupervised approach, which takes an input sentence and predicts itself in contrastive objective, with only standard dropout used as noise. The authors find that dropout acts as minimal “data augmentation” of hidden representations, while removing it leads to a representation collapse. Afterwards a supervised approach is used, which incorporates annotated pairs from natural language inference datasets into the contrastive framework, by using “entailment” pairs as positives and “contradiction

Source: SimCSE: Simple Contrastive Learning of Sentence Embeddings

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Sentence	36	26.09%
Sentence Embeddings	28	20.29%
Semantic Textual Similarity	18	13.04%
Sentence Embedding	13	9.42%
Language Modelling	7	5.07%
Retrieval	6	4.35%
Semantic Similarity	4	2.90%
Question Answering	2	1.45%
Information Retrieval	2	1.45%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Sentence Embeddings