TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Metric Learning	CARS196	ProxyAnchor + DIML	R@1	87.01	# 22
Metric Learning	CUB-200-2011	MS + DIML	R@1	68.15	# 16
Metric Learning	Stanford Online Products	Margin + DIML	R@1	79.26	# 27

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-interpretable-deep-metric-learning/metric-learning-on-cub-200-2011)](https://paperswithcode.com/sota/metric-learning-on-cub-200-2011?p=towards-interpretable-deep-metric-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-interpretable-deep-metric-learning/metric-learning-on-cars196)](https://paperswithcode.com/sota/metric-learning-on-cars196?p=towards-interpretable-deep-metric-learning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/towards-interpretable-deep-metric-learning/metric-learning-on-stanford-online-products-1)](https://paperswithcode.com/sota/metric-learning-on-stanford-online-products-1?p=towards-interpretable-deep-metric-learning)`

Towards Interpretable Deep Metric Learning with Structural Matching

ICCV 2021 · Wenliang Zhao, Yongming Rao, Ziyi Wang, Jiwen Lu, Jie zhou ·

How do the neural networks distinguish two images? It is of critical importance to understand the matching mechanism of deep models for developing reliable intelligent systems for many risky visual applications such as surveillance and access control. However, most existing deep metric learning methods match the images by comparing feature vectors, which ignores the spatial structure of images and thus lacks interpretability. In this paper, we present a deep interpretable metric learning (DIML) method for more transparent embedding learning. Unlike conventional metric learning methods based on feature vector comparison, we propose a structural matching strategy that explicitly aligns the spatial embeddings by computing an optimal matching flow between feature maps of the two images. Our method enables deep models to learn metrics in a more human-friendly way, where the similarity of two images can be decomposed to several part-wise similarities and their contributions to the overall similarity. Our method is model-agnostic, which can be applied to off-the-shelf backbone networks and metric learning methods. We evaluate our method on three major benchmarks of deep metric learning including CUB200-2011, Cars196, and Stanford Online Products, and achieve substantial improvements over popular metric learning methods with better interpretability. Code is available at https://github.com/wl-zhao/DIML

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

wl-zhao/diml official

Tasks

Add Remove

Metric Learning

Datasets

CUB-200-2011

Stanford Cars

Stanford Online Products CARS196

Results from the Paper

Add Remove

Ranked #16 on Metric Learning on CUB-200-2011

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Metric Learning	CARS196	ProxyAnchor + DIML	R@1	87.01	# 22	Compare
Metric Learning	CUB-200-2011	MS + DIML	R@1	68.15	# 16	Compare
Metric Learning	Stanford Online Products	Margin + DIML	R@1	79.26	# 27	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Towards Interpretable Deep Metric Learning with Structural Matching

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove