TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sketch-Based Image Retrieval	Chairs	Chairs net + CFF + HOLEF	R@1	81.4	# 2
Sketch-Based Image Retrieval	Chairs	Chairs net + CFF + HOLEF	R@10	95.9	# 3
Sketch-Based Image Retrieval	Handbags	Handbags net	R@1	39.9	# 3
Sketch-Based Image Retrieval	Handbags	Handbags net	R@10	82.1	# 3
Sketch-Based Image Retrieval	Handbags	Handbags net + CFF + HOLEF	R@1	49.4	# 2
Sketch-Based Image Retrieval	Handbags	Handbags net + CFF + HOLEF	R@10	82.7	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-spatial-semantic-attention-for-fine/sketch-based-image-retrieval-on-chairs)](https://paperswithcode.com/sota/sketch-based-image-retrieval-on-chairs?p=deep-spatial-semantic-attention-for-fine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-spatial-semantic-attention-for-fine/sketch-based-image-retrieval-on-handbags)](https://paperswithcode.com/sota/sketch-based-image-retrieval-on-handbags?p=deep-spatial-semantic-attention-for-fine)`

Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval

ICCV 2017 · Jifei Song, Qian Yu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales ·

Human sketches are unique in being able to capture both the spatial topology of a visual object, as well as its subtle appearance details. Fine-grained sketch-based image retrieval (FG-SBIR) importantly leverages on such fine-grained characteristics of sketches to conduct instance-level retrieval of photos. Nevertheless, human sketches are often highly abstract and iconic, resulting in severe misalignments with candidate photos which in turn make subtle visual detail matching difficult. Existing FG-SBIR approaches focus only on coarse holistic matching via deep cross-domain representation learning, yet ignore explicitly accounting for fine-grained details and their spatial context. In this paper, a novel deep FG-SBIR model is proposed which differs significantly from the existing models in that: (1) It is spatially aware, achieved by introducing an attention module that is sensitive to the spatial position of visual details; (2) It combines coarse and fine semantic information via a shortcut connection fusion block; and (3) It models feature correlation and is robust to misalignments between the extracted features across the two domains by introducing a novel higher order learnable energy function (HOLEF) based loss. Extensive experiments show that the proposed deep spatial-semantic attention model significantly outperforms the state-of-the-art.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Feature Correlation

Image Retrieval

Representation Learning

Retrieval

Sketch-Based Image Retrieval

Datasets

Sketch

Chairs

Results from the Paper

Add Remove

Ranked #2 on Sketch-Based Image Retrieval on Handbags

Get a GitHub badge

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Sketch-Based Image Retrieval	Chairs	Chairs net + CFF + HOLEF	R@1	81.4	# 2	See all
Sketch-Based Image Retrieval	Chairs	Chairs net + CFF + HOLEF	R@10	95.9	# 3	See all
Sketch-Based Image Retrieval	Handbags	Handbags net	R@1	39.9	# 3	See all
Sketch-Based Image Retrieval	Handbags	Handbags net	R@10	82.1	# 3	See all
Sketch-Based Image Retrieval	Handbags	Handbags net + CFF + HOLEF	R@1	49.4	# 2	See all
Sketch-Based Image Retrieval	Handbags	Handbags net + CFF + HOLEF	R@10	82.7	# 2	See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove