TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Metric Learning	CARS196	ResNet50 + Language	R@1	90.2	# 8
Metric Learning	CUB-200-2011	ResNet50 + Language	R@1	71.4	# 8
Metric Learning	Stanford Online Products	ResNet50 + Language	R@1	81.3	# 15

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/integrating-language-guidance-into-vision/metric-learning-on-cars196)](https://paperswithcode.com/sota/metric-learning-on-cars196?p=integrating-language-guidance-into-vision)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/integrating-language-guidance-into-vision/metric-learning-on-cub-200-2011)](https://paperswithcode.com/sota/metric-learning-on-cub-200-2011?p=integrating-language-guidance-into-vision)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/integrating-language-guidance-into-vision/metric-learning-on-stanford-online-products-1)](https://paperswithcode.com/sota/metric-learning-on-stanford-online-products-1?p=integrating-language-guidance-into-vision)`

Integrating Language Guidance into Vision-based Deep Metric Learning

CVPR 2022 · Karsten Roth, Oriol Vinyals, Zeynep Akata ·

Deep Metric Learning (DML) proposes to learn metric spaces which encode semantic similarities as embedding space distances. These spaces should be transferable to classes beyond those seen during training. Commonly, DML methods task networks to solve contrastive ranking tasks defined over binary class assignments. However, such approaches ignore higher-level semantic relations between the actual classes. This causes learned embedding spaces to encode incomplete semantic context and misrepresent the semantic relation between classes, impacting the generalizability of the learned metric space. To tackle this issue, we propose a language guidance objective for visual similarity learning. Leveraging language embeddings of expert- and pseudo-classnames, we contextualize and realign visual representation spaces corresponding to meaningful language semantics for better semantic consistency. Extensive experiments and ablations provide a strong motivation for our proposed approach and show language guidance offering significant, model-agnostic improvements for DML, achieving competitive and state-of-the-art results on all benchmarks. Code available at https://github.com/ExplainableML/LanguageGuidance_for_DML.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

explainableml/languageguidance_for_… official

Tasks

Add Remove

Metric Learning

Datasets

CUB-200-2011

Stanford Online Products CARS196

Results from the Paper

Edit

Ranked #8 on Metric Learning on CARS196 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Metric Learning	CARS196	ResNet50 + Language	R@1	90.2	# 8	Compare
Metric Learning	CUB-200-2011	ResNet50 + Language	R@1	71.4	# 8	Compare
Metric Learning	Stanford Online Products	ResNet50 + Language	R@1	81.3	# 15	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Integrating Language Guidance into Vision-based Deep Metric Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove