Cross-modal retrieval with noisy correspondence

14 papers with code • 3 benchmarks • 5 datasets

Noisy correspondence learning aims to eliminate the negative impact of the mismatched pairs (e.g., false positives/negatives) instead of annotation errors in several tasks.

Most implemented papers

Negative Pre-aware for Noisy Cross-modal Matching

zhangxu0963/npc 10 Dec 2023

Since clean samples are easier distinguished by GMM with increasing noise, the memory bank can still maintain high quality at a high noise ratio.

Learning with Noisy Correspondence for Cross-modal Matching

XLearning-SCU/2021-NeurIPS-NCR NeurIPS 2021

Based on this observation, we reveal and study a latent and challenging direction in cross-modal matching, named noisy correspondence, which could be regarded as a new paradigm of noisy labels.

Deep Evidential Learning with Noisy Correspondence for Cross-Modal Retrieval

qinyang79/decl ACM International Conference on Multimedia 2022

However, it will unavoidably introduce noise (i. e., mismatched pairs) into training data, dubbed noisy correspondence.

Cross-Modal Retrieval with Partially Mismatched Pairs

penghu-cs/RCL IEEE Transactions on Pattern Analysis and Machine Intelligence 2023

On the one hand, our method only utilizes the negative information which is much less likely false compared with the positive information, thus avoiding the overfitting issue to PMPs.

BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity Consistency

xu5zhao/bicro CVPR 2023

As one of the most fundamental techniques in multimodal learning, cross-modal matching aims to project various sensory modalities into a shared feature space.

Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives

AAA-Zheng/LG_ITM IEEE Transactions on Multimedia 2023

Extensive experiments on two ITM benchmarks show that our method can improve the performance of existing ITM models.

Noisy Correspondence Learning with Meta Similarity Correction

hhc1997/mscn CVPR 2023

Despite the success of multimodal learning in cross-modal retrieval task, the remarkable progress relies on the correct correspondence among multimedia data.

Cross-modal Active Complementary Learning with Self-refining Correspondence

qinyang79/crcl NeurIPS 2023

Recently, image-text matching has attracted more and more attention from academia and industry, which is fundamental to understanding the latent correspondence across visual and textual modalities.

Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

hhc1997/l2rm CVPR 2024

To achieve this, we propose L2RM, a general framework based on Optimal Transport (OT) that learns to rematch mismatched pairs.

Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining

XLearning-SCU/2024-TIP-CREAM IEEE Transactions on Image Processing 2024

Thanks to the consistency refining and mining strategy of CREAM, the overfitting on the false positives could be prevented and the consistency rooted in the false negatives could be exploited, thus leading to a robust CMR method.