TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Metric Learning	CARS196	ResNet-50 + Cross-Entropy	R@1	89.3	# 12
Metric Learning	CUB-200-2011	ResNet-50 + Cross-Entropy	R@1	69.2	# 13
Metric Learning	In-Shop	ResNet-50 + Cross-Entropy	R@1	90.6	# 13
Metric Learning	Stanford Online Products	ResNet-50 + Cross-Entropy	R@1	81.1	# 18

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/metric-learning-cross-entropy-vs-pairwise/metric-learning-on-cars196)](https://paperswithcode.com/sota/metric-learning-on-cars196?p=metric-learning-cross-entropy-vs-pairwise)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/metric-learning-cross-entropy-vs-pairwise/metric-learning-on-cub-200-2011)](https://paperswithcode.com/sota/metric-learning-on-cub-200-2011?p=metric-learning-cross-entropy-vs-pairwise)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/metric-learning-cross-entropy-vs-pairwise/metric-learning-on-in-shop-1)](https://paperswithcode.com/sota/metric-learning-on-in-shop-1?p=metric-learning-cross-entropy-vs-pairwise)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/metric-learning-cross-entropy-vs-pairwise/metric-learning-on-stanford-online-products-1)](https://paperswithcode.com/sota/metric-learning-on-stanford-online-products-1?p=metric-learning-cross-entropy-vs-pairwise)`

A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses

ECCV 2020 · Malik Boudiaf, Jérôme Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida, Ismail Ben Ayed ·

Recently, substantial research efforts in Deep Metric Learning (DML) focused on designing complex pairwise-distance losses, which require convoluted schemes to ease optimization, such as sample mining or pair weighting. The standard cross-entropy loss for classification has been largely overlooked in DML. On the surface, the cross-entropy may seem unrelated and irrelevant to metric learning as it does not explicitly involve pairwise distances. However, we provide a theoretical analysis that links the cross-entropy to several well-known and recent pairwise losses. Our connections are drawn from two different perspectives: one based on an explicit optimization insight; the other on discriminative and generative views of the mutual information between the labels and the learned features. First, we explicitly demonstrate that the cross-entropy is an upper bound on a new pairwise loss, which has a structure similar to various pairwise losses: it minimizes intra-class distances while maximizing inter-class distances. As a result, minimizing the cross-entropy can be seen as an approximate bound-optimization (or Majorize-Minimize) algorithm for minimizing this pairwise loss. Second, we show that, more generally, minimizing the cross-entropy is actually equivalent to maximizing the mutual information, to which we connect several well-known pairwise losses. Furthermore, we show that various standard pairwise losses can be explicitly related to one another via bound relationships. Our findings indicate that the cross-entropy represents a proxy for maximizing the mutual information -- as pairwise losses do -- without the need for convoluted sample-mining heuristics. Our experiments over four standard DML benchmarks strongly support our findings. We obtain state-of-the-art results, outperforming recent and complex DML methods.

PDF Abstract ECCV 2020 PDF ECCV 2020 Abstract

Code

Add Remove Mark official

jeromerony/dml_cross_entropy official

163

Tasks

Add Remove

Metric Learning

Datasets

ImageNet

CUB-200-2011

Stanford Online Products

In-Shop CARS196

Results from the Paper

Edit

Ranked #12 on Metric Learning on CARS196 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Metric Learning	CARS196	ResNet-50 + Cross-Entropy	R@1	89.3	# 12	Compare
Metric Learning	CUB-200-2011	ResNet-50 + Cross-Entropy	R@1	69.2	# 13	Compare
Metric Learning	In-Shop	ResNet-50 + Cross-Entropy	R@1	90.6	# 13	Compare
Metric Learning	Stanford Online Products	ResNet-50 + Cross-Entropy	R@1	81.1	# 18	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove