Sentence Similarity
66 papers with code • 1 benchmarks • 1 datasets
Latest papers with no code
From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization
To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models.
Self-Critical Alternate Learning based Semantic Broadcast Communication
In particular, to enable stable optimization via a nondifferentiable semantic metric, we regard sentence similarity as a reward and formulate this learning process as an RL problem.
A Distribution-Based Threshold for Determining Sentence Similarity
The goal of these distributions is to find a discriminating factor, that we call "threshold", which represents a well-defined quantity that can be used to distinguish vector distances of similar pairs from vector distances of dissimilar pairs in new predictions and later analyses.
Validating ChatGPT Facts through RDF Knowledge Graphs and Sentence Similarity
Since ChatGPT offers detailed responses without justifications, and erroneous facts even for popular persons, events and places, in this paper we present a novel pipeline that retrieves the response of ChatGPT in RDF and tries to validate the ChatGPT facts using one or more RDF Knowledge Graphs (KGs).
Deep Learning-Empowered Semantic Communication Systems with a Shared Knowledge Base
With the aid of the shared knowledge base, the proposed system integrates the message and corresponding knowledge from the shared knowledge base to obtain the residual information, which enables the system to transmit fewer symbols without semantic performance degradation.
An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology
Natural Language Processing (NLP) is a key technique for developing Medical Artificial Intelligence (AI) systems that leverage Electronic Health Record (EHR) data to build diagnostic and prognostic models.
Exploring the Boundaries of GPT-4 in Radiology
In this paper, we focus on assessing the performance of GPT-4, the most capable LLM so far, on the text-based applications for radiology reports, comparing against state-of-the-art (SOTA) radiology-specific models.
A Comparative Study of Sentence Embedding Models for Assessing Semantic Variation
In this paper, we compare several recent sentence embedding methods via time-series of semantic similarity between successive sentences and matrices of pairwise sentence similarity for multiple books of literature.
Semantic Equivalence of e-Commerce Queries
Behavioral similarity leverages historical search behavior to generate vector representations of query intent.
LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM
Third, we apply a Siamese architecture on BLOOM model with a contrastive objective to ease the multi-lingual labeled data scarcity.