Semantic Textual Similarity

557 papers with code • 13 benchmarks • 17 datasets

Semantic textual similarity deals with determining how similar two pieces of texts are. This can take the form of assigning a score from 1 to 5. Related tasks are paraphrase or duplicate identification.

Image source: Learning Semantic Textual Similarity from Conversations

Libraries

Use these libraries to find Semantic Textual Similarity models and implementations

Latest papers with no code

Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks

no code yet • 17 Apr 2024

This method transmits only the semantic information of an image, and the receiver reconstructs the image using an image-generation model.

Prompt-tuning for Clickbait Detection via Text Summarization

no code yet • 17 Apr 2024

To address this problem, we propose a prompt-tuning method for clickbait detection via text summarization in this paper, text summarization is introduced to summarize the contents, and clickbait detection is performed based on the similarity between the generated summary and the contents.

Toward a Realistic Benchmark for Out-of-Distribution Detection

no code yet • 16 Apr 2024

Deep neural networks are increasingly used in a wide range of technologies and services, but remain highly susceptible to out-of-distribution (OOD) samples, that is, drawn from a different distribution than the original training set.

Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods

no code yet • 8 Apr 2024

In various real-world applications such as machine translation, sentiment analysis, and question answering, a pivotal role is played by NLP models, facilitating efficient communication and decision-making processes in domains ranging from healthcare to finance.

Know When To Stop: A Study of Semantic Drift in Text Generation

no code yet • 8 Apr 2024

Overall, our methods generalize and can be applied to any long-form text generation to produce more reliable information, by balancing trade-offs between factual accuracy, information quantity and computational cost.

Personalized Federated Learning for Spatio-Temporal Forecasting: A Dual Semantic Alignment-Based Contrastive Approach

no code yet • 4 Apr 2024

From spatial perspective, we design lightweight-but-efficient prototypes as client-level semantic representations, based on which the server evaluates spatial similarity and yields client-customized global prototypes for the supplemented inter-client contrastive task.

ALOHa: A New Measure for Hallucination in Captioning Models

no code yet • 3 Apr 2024

Despite recent advances in multimodal pre-training for visual description, state-of-the-art models still produce captions containing errors, such as hallucinating objects not present in a scene.

ParaICL: Towards Robust Parallel In-Context Learning

no code yet • 31 Mar 2024

However, our preliminary experiments indicate that the effectiveness of ICL is limited by the length of the input context.

Attention-aware semantic relevance predicting Chinese sentence reading

no code yet • 27 Mar 2024

Our approach underscores the potential of these metrics to advance our comprehension of how humans understand and process language, ultimately leading to a better understanding of language comprehension and processing.

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

no code yet • 27 Mar 2024

Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.