TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Person Re-Identification	DukeMTMC-reID	VAL-PAT	Rank-1	86.1	# 2
Unsupervised Person Re-Identification	DukeMTMC-reID	VAL-PAT	MAP	74.9	# 2
Unsupervised Person Re-Identification	MSMT17	VAL-PAT	mAP	38.9	# 8
Unsupervised Person Re-Identification	MSMT17	VAL-PAT	Rank-1	67.5	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-transferable-pedestrian/unsupervised-person-re-identification-on-5)](https://paperswithcode.com/sota/unsupervised-person-re-identification-on-5?p=learning-transferable-pedestrian)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-transferable-pedestrian/unsupervised-person-re-identification-on-12)](https://paperswithcode.com/sota/unsupervised-person-re-identification-on-12?p=learning-transferable-pedestrian)`

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

12 Apr 2023 · Liping Bao, Longhui Wei, Xiaoyu Qiu, Wengang Zhou, Houqiang Li, Qi Tian ·

Recent researches on unsupervised person re-identification~(reID) have demonstrated that pre-training on unlabeled person images achieves superior performance on downstream reID tasks than pre-training on ImageNet. However, those pre-trained methods are specifically designed for reID and suffer flexible adaption to other pedestrian analysis tasks. In this paper, we propose VAL-PAT, a novel framework that learns transferable representations to enhance various pedestrian analysis tasks with multimodal information. To train our framework, we introduce three learning objectives, \emph{i.e.,} self-supervised contrastive learning, image-text contrastive learning and multi-attribute classification. The self-supervised contrastive learning facilitates the learning of the intrinsic pedestrian properties, while the image-text contrastive learning guides the model to focus on the appearance information of pedestrians.Meanwhile, multi-attribute classification encourages the model to recognize attributes to excavate fine-grained pedestrian information. We first perform pre-training on LUPerson-TA dataset, where each image contains text and attribute annotations, and then transfer the learned representations to various downstream tasks, including person reID, person attribute recognition and text-based person search. Extensive experiments demonstrate that our framework facilitates the learning of general pedestrian representations and thus leads to promising results on various pedestrian analysis tasks.

PDF Abstract

Code

Add Remove Mark official

baolp/VAL-PAT official

Tasks

Add Remove

Attribute

Contrastive Learning

Person Re-Identification

Person Search

Text based Person Search

Unsupervised Person Re-Identification

Datasets

DukeMTMC-reID MSMT17

CUHK-PEDES

Occluded REID

PA-100K

RAP

Results from the Paper

Edit

Ranked #2 on Unsupervised Person Re-Identification on DukeMTMC-reID

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Person Re-Identification	DukeMTMC-reID	VAL-PAT	Rank-1	86.1	# 2	Compare
Unsupervised Person Re-Identification	DukeMTMC-reID	VAL-PAT	MAP	74.9	# 2	Compare
Unsupervised Person Re-Identification	MSMT17	VAL-PAT	mAP	38.9	# 8	Compare
Unsupervised Person Re-Identification	MSMT17	VAL-PAT	Rank-1	67.5	# 7	Compare

Methods

Add Remove

Contrastive Learning

Edit Social Preview

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove