Text Matching

129 papers with code • 0 benchmarks • 7 datasets

Matching a target text to a source text based on their meaning.

Latest papers with no code

MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

no code yet • 5 Mar 2024

Development of multimodal interactive systems is hindered by the lack of rich, multimodal (text, images) conversational data, which is needed in large quantities for LLMs.

Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification

no code yet • 5 Mar 2024

Human interventions to the rules for the TACRED relation \texttt{org:parents} boost the performance on that relation by as much as 26\% relative improvement, without negatively impacting the other relations, and without retraining the semantic matching component.

Image-Text Matching with Multi-View Attention

no code yet • 27 Feb 2024

Existing two-stream models for image-text matching show good performance while ensuring retrieval speed and have received extensive attention from industry and academia.

Multi-Intent Attribute-Aware Text Matching in Searching

no code yet • 12 Feb 2024

Since attributes from two ends are often not aligned in terms of number and type, we propose to exploit the benefit of attributes by multiple-intent modeling.

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition

no code yet • 18 Jan 2024

Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown impressive performance in various visual recognition tasks.

Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech

no code yet • 12 Jan 2024

Furthermore, experiments on the continuous speech dataset LibriSpeech demonstrate that, by incorporating audio discrimination, CLAD achieves significant performance gain over CL without audio discrimination.

CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs

no code yet • 5 Jan 2024

When exploring the development of Artificial General Intelligence (AGI), a critical task for these models involves interpreting and processing information from multiple image inputs.

Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIP

no code yet • 1 Jan 2024

Notably, MedCLIP, a vision-language contrastive learning-based medical FM, has been designed using unpaired image-text training.

OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

no code yet • 7 Dec 2023

Investigating the generation of high-transferability adversarial examples is crucial for uncovering VLP models' vulnerabilities in practical scenarios.

Plug-and-Play, Dense-Label-Free Extraction of Open-Vocabulary Semantic Segmentation from Vision-Language Models

no code yet • 28 Nov 2023

To alleviate this issue, we introduce Salience Dropout; by iteratively dropping patches that the model is most attentive to, we are able to better resolve the entire extent of the segmentation mask.