TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text based Person Retrieval	CUHK-PEDES	CADA	mAP	68.87	# 2
Text based Person Retrieval	CUHK-PEDES	CADA	Rank-1	78.37	# 1
Text based Person Retrieval	CUHK-PEDES	CADA	Rank-5	91.57	# 1
Text based Person Retrieval	CUHK-PEDES	CADA	Rank-10	94.58	# 1
Text based Person Retrieval	ICFG-PEDES	CADA	Rank-1	67.81	# 1
Text based Person Retrieval	ICFG-PEDES	CADA	mAP	39.85	# 4
Text based Person Retrieval	ICFG-PEDES	CADA	Rank-5	82.34	# 1
Text based Person Retrieval	ICFG-PEDES	CADA	Rank-10	87.14	# 1
Text based Person Retrieval	RSTPReid	CADA	mAP	52.74	# 1
Text based Person Retrieval	RSTPReid	CADA	Rank-1	69.6	# 1
Text based Person Retrieval	RSTPReid	CADA	Rank-5	86.75	# 1
Text based Person Retrieval	RSTPReid	CADA	Rank-10	92.4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-modal-adaptive-dual-association-for/nlp-based-person-retrival-on-cuhk-pedes)](https://paperswithcode.com/sota/nlp-based-person-retrival-on-cuhk-pedes?p=cross-modal-adaptive-dual-association-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-modal-adaptive-dual-association-for/text-based-person-retrieval-on-icfg-pedes)](https://paperswithcode.com/sota/text-based-person-retrieval-on-icfg-pedes?p=cross-modal-adaptive-dual-association-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cross-modal-adaptive-dual-association-for/text-based-person-retrieval-on-rstpreid-1)](https://paperswithcode.com/sota/text-based-person-retrieval-on-rstpreid-1?p=cross-modal-adaptive-dual-association-for)`

Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval

4 Dec 2023 · Dixuan Lin, Yixing Peng, Jingke Meng, Wei-Shi Zheng ·

Text-to-image person re-identification (ReID) aims to retrieve images of a person based on a given textual description. The key challenge is to learn the relations between detailed information from visual and textual modalities. Existing works focus on learning a latent space to narrow the modality gap and further build local correspondences between two modalities. However, these methods assume that image-to-text and text-to-image associations are modality-agnostic, resulting in suboptimal associations. In this work, we show the discrepancy between image-to-text association and text-to-image association and propose CADA: Cross-Modal Adaptive Dual Association that finely builds bidirectional image-text detailed associations. Our approach features a decoder-based adaptive dual association module that enables full interaction between visual and textual modalities, allowing for bidirectional and adaptive cross-modal correspondence associations. Specifically, the paper proposes a bidirectional association mechanism: Association of text Tokens to image Patches (ATP) and Association of image Regions to text Attributes (ARA). We adaptively model the ATP based on the fact that aggregating cross-modal features based on mistaken associations will lead to feature distortion. For modeling the ARA, since the attributes are typically the first distinguishing cues of a person, we propose to explore the attribute-level association by predicting the masked text phrase using the related image region. Finally, we learn the dual associations between texts and images, and the experimental results demonstrate the superiority of our dual formulation. Codes will be made publicly available.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Attribute

Cross-Modal Person Re-Identification

Person Re-Identification

Person Retrieval

Retrieval

Text based Person Retrieval

Text-based Person Retrieval

Datasets

MSMT17

CUHK-PEDES

RSTPReid ICFG-PEDES

Results from the Paper

Edit

Ranked #1 on Text based Person Retrieval on RSTPReid (mAP metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text based Person Retrieval	CUHK-PEDES	CADA	mAP	68.87	# 2	Compare
			Rank-1	78.37	# 1	Compare
			Rank-5	91.57	# 1	Compare
			Rank-10	94.58	# 1	Compare
Text based Person Retrieval	ICFG-PEDES	CADA	Rank-1	67.81	# 1	Compare
			mAP	39.85	# 4	Compare
			Rank-5	82.34	# 1	Compare
			Rank-10	87.14	# 1	Compare
Text based Person Retrieval	RSTPReid	CADA	mAP	52.74	# 1	Compare
			Rank-1	69.6	# 1	Compare
			Rank-5	86.75	# 1	Compare
			Rank-10	92.4	# 1	Compare

Methods

Add Remove

Focus

Edit Social Preview

Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove