TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	ClonedPerson->mAP	24.4	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	ClonedPerson->Rank-1	25.4	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	RandPerson->mAP	16.0	# 2
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	RandPerson->Rank-1	17.1	# 2
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	MSMT17->mAP	22.5	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	MSMT17->Rank-1	23.7	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	MSMT17-All->mAP	30.7	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	MSMT17-All->Rank-1	31.9	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	Market-1501->mAP	21.4	# 1
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	Market-1501->Rank-1	22.2	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	ClonedPerson->mAP	62.3	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	ClonedPerson->Rank-1	84.8	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	RandPerson->mAP	49.1	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	RandPerson->Rank-1	77.3	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	MSMT17->mAP	52.0	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	MSMT17->Rank-1	80.1	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	MSMT17-All->mAP	58.4	# 1
Generalizable Person Re-identification	Market-1501	TransMatcher	MSMT17-All->Rank-1	82.6	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	RandPerson->mAP	17.7	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	RandPerson->Rank-1	48.3	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	Market-1501->Rank1	47.3	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	Market-1501->mAP	18.4	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	ClonedPerson->mAP	20.8	# 1
Generalizable Person Re-identification	MSMT17	TransMatcher	ClonedPerson->Rank-1	51.6	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transformer-based-deep-image-matching-for/generalizable-person-re-identification-on-22)](https://paperswithcode.com/sota/generalizable-person-re-identification-on-22?p=transformer-based-deep-image-matching-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transformer-based-deep-image-matching-for/generalizable-person-re-identification-on-21)](https://paperswithcode.com/sota/generalizable-person-re-identification-on-21?p=transformer-based-deep-image-matching-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transformer-based-deep-image-matching-for/generalizable-person-re-identification-on-20)](https://paperswithcode.com/sota/generalizable-person-re-identification-on-20?p=transformer-based-deep-image-matching-for)`

TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

NeurIPS 2021 · Shengcai Liao, Ling Shao ·

Transformers have recently gained increasing attention in computer vision. However, existing studies mostly use Transformers for feature representation learning, e.g. for image classification and dense predictions, and the generalizability of Transformers is unknown. In this work, we further investigate the possibility of applying Transformers for image matching and metric learning given pairs of images. We find that the Vision Transformer (ViT) and the vanilla Transformer with decoders are not adequate for image matching due to their lack of image-to-image attention. Thus, we further design two naive solutions, i.e. query-gallery concatenation in ViT, and query-gallery cross-attention in the vanilla Transformer. The latter improves the performance, but it is still limited. This implies that the attention mechanism in Transformers is primarily designed for global feature aggregation, which is not naturally suitable for image matching. Accordingly, we propose a new simplified decoder, which drops the full attention implementation with the softmax weighting, keeping only the query-key similarity computation. Additionally, global max pooling and a multilayer perceptron (MLP) head are applied to decode the matching result. This way, the simplified decoder is computationally more efficient, while at the same time more effective for image matching. The proposed method, called TransMatcher, achieves state-of-the-art performance in generalizable person re-identification, with up to 6.1% and 5.7% performance gains in Rank-1 and mAP, respectively, on several popular datasets. Code is available at https://github.com/ShengcaiLiao/QAConv.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

shengcailiao/QAConv official

195

ShengcaiLiao/TransMatcher official

Tasks

Add Remove

Generalizable Person Re-identification

Image Classification

Metric Learning

Person Re-Identification

Representation Learning

Datasets

Market-1501

CUHK03

DukeMTMC-reID MSMT17

VIPeR

Results from the Paper

Edit

Ranked #1 on Generalizable Person Re-identification on Market-1501 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Generalizable Person Re-identification	CUHK03-NP (detected)	TransMatcher	ClonedPerson->mAP	24.4	# 1	Compare
			ClonedPerson->Rank-1	25.4	# 1	Compare
			RandPerson->mAP	16.0	# 2	Compare
			RandPerson->Rank-1	17.1	# 2	Compare
			MSMT17->mAP	22.5	# 1	Compare
			MSMT17->Rank-1	23.7	# 1	Compare
			MSMT17-All->mAP	30.7	# 1	Compare
			MSMT17-All->Rank-1	31.9	# 1	Compare
			Market-1501->mAP	21.4	# 1	Compare
			Market-1501->Rank-1	22.2	# 1	Compare
Generalizable Person Re-identification	Market-1501	TransMatcher	ClonedPerson->mAP	62.3	# 1	Compare
			ClonedPerson->Rank-1	84.8	# 1	Compare
			RandPerson->mAP	49.1	# 1	Compare
			RandPerson->Rank-1	77.3	# 1	Compare
			MSMT17->mAP	52.0	# 1	Compare
			MSMT17->Rank-1	80.1	# 1	Compare
			MSMT17-All->mAP	58.4	# 1	Compare
			MSMT17-All->Rank-1	82.6	# 1	Compare
Generalizable Person Re-identification	MSMT17	TransMatcher	RandPerson->mAP	17.7	# 1	Compare
			RandPerson->Rank-1	48.3	# 1	Compare
			Market-1501->Rank1	47.3	# 1	Compare
			Market-1501->mAP	18.4	# 1	Compare
			ClonedPerson->mAP	20.8	# 1	Compare
			ClonedPerson->Rank-1	51.6	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Max Pooling • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer • Vision Transformer

Edit Social Preview

TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove