Semantic Textual Similarity

557 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Textual Similarity

Dataset	Best Model	Compare
STS Benchmark	MT-DNN-SMART	See all
MRPC	MT-DNN-SMART	See all
MTEB	ST5-XXL	See all
STS13	AnglE-LLaMA-13B	See all
SICK	PromCSE-RoBERTa-large (0.355B)	See all
STS12	PromptEOL+CSE+OPT-13B	See all
STS14	AnglE-LLaMA-13B	See all
STS15	AnglE-LLaMA-13B	See all
STS16	AnglE-LLaMA-13B	See all
SentEval	GenSen	See all
CxC	PromCSE-RoBERTa-large (0.355B)	See all
SICK-R	AnglE-LLaMA-7B	See all
MRPC Dev	Synthesizer (R+V)	See all

Show all 13 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

huggingface/transformers

9 papers

124,527

facebookresearch/xformers

3 papers

7,522

facebookresearch/InferSent

3 papers

2,279

namisan/mt-dnn

3 papers

2,198

See all 11 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks

no code yet • 17 Apr 2024

This method transmits only the semantic information of an image, and the receiver reconstructs the image using an image-generation model.

Paper
Add Code

Prompt-tuning for Clickbait Detection via Text Summarization

no code yet • 17 Apr 2024

To address this problem, we propose a prompt-tuning method for clickbait detection via text summarization in this paper, text summarization is introduced to summarize the contents, and clickbait detection is performed based on the similarity between the generated summary and the contents.

Paper
Add Code

Toward a Realistic Benchmark for Out-of-Distribution Detection

no code yet • 16 Apr 2024

Deep neural networks are increasingly used in a wide range of technologies and services, but remain highly susceptible to out-of-distribution (OOD) samples, that is, drawn from a different distribution than the original training set.

Paper
Add Code

Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods

no code yet • 8 Apr 2024

In various real-world applications such as machine translation, sentiment analysis, and question answering, a pivotal role is played by NLP models, facilitating efficient communication and decision-making processes in domains ranging from healthcare to finance.

Paper
Add Code

Know When To Stop: A Study of Semantic Drift in Text Generation

no code yet • 8 Apr 2024

Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.

Paper
Add Code

Personalized Federated Learning for Spatio-Temporal Forecasting: A Dual Semantic Alignment-Based Contrastive Approach

no code yet • 4 Apr 2024

From spatial perspective, we design lightweight-but-efficient prototypes as client-level semantic representations, based on which the server evaluates spatial similarity and yields client-customized global prototypes for the supplemented inter-client contrastive task.

Paper
Add Code

ALOHa: A New Measure for Hallucination in Captioning Models

no code yet • 3 Apr 2024

Despite recent advances in multimodal pre-training for visual description, state-of-the-art models still produce captions containing errors, such as hallucinating objects not present in a scene.

Paper
Add Code

ParaICL: Towards Robust Parallel In-Context Learning

no code yet • 31 Mar 2024

However, our preliminary experiments indicate that the effectiveness of ICL is limited by the length of the input context.

Paper
Add Code

Attention-aware semantic relevance predicting Chinese sentence reading

no code yet • 27 Mar 2024

Our approach underscores the potential of these metrics to advance our comprehension of how humans understand and process language, ultimately leading to a better understanding of language comprehension and processing.

Paper
Add Code

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

no code yet • 27 Mar 2024

Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.

Paper
Add Code

Semantic Textual Similarity

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result