TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Fine-Grained Image Classification	CUB-200-2011	AENet	Accuracy	90.0%	# 25
Fine-Grained Image Classification	FGVC Aircraft	AENet	Accuracy	94.5%	# 8
Fine-Grained Image Classification	Stanford Cars	AENet	Accuracy	94.0%	# 47

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/alignment-enhancement-network-for-fine/fine-grained-image-classification-on-fgvc)](https://paperswithcode.com/sota/fine-grained-image-classification-on-fgvc?p=alignment-enhancement-network-for-fine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/alignment-enhancement-network-for-fine/fine-grained-image-classification-on-cub-200)](https://paperswithcode.com/sota/fine-grained-image-classification-on-cub-200?p=alignment-enhancement-network-for-fine)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/alignment-enhancement-network-for-fine/fine-grained-image-classification-on-stanford)](https://paperswithcode.com/sota/fine-grained-image-classification-on-stanford?p=alignment-enhancement-network-for-fine)`

Alignment Enhancement Network for Fine-grained Visual Categorization

1 Mar 2021 · Yutao Hu ·

Fine-grained visual categorization (FGVC) aims to automatically recognize objects from different sub-ordinate categories. Despite attracting considerable attention from both academia and industry, it remains a challenging task due to subtle visual differences among different classes. Cross-layer feature aggregation and crossimage pairwise learning become prevailing in improving the performance of FGVC by extracting discriminative class-specific features. However, they are still inefficient to fully use the cross-layer information based on the simple aggregation strategy, while existing pairwise learning methods also fail to explore long-range interactions between different images. To address these problems, we propose a novel Alignment Enhancement Network (AENet), including two-level alignments, Cross-layer Alignment (CLA) and Cross-image Alignment (CIA). The CLA module exploits the cross-layer relationship between low-level spatial information and highlevel semantic information, which contributes to cross-layer feature aggregation to improve the capacity of feature representation for input images. The new CIA module is further introduced to produce the aligned feature map, which can enhance the relevant information as well as suppress the irrelevant information across the whole spatial region. Our method is based on an underlying assumption that the aligned feature map should be closer to the inputs of CIA when they belong to the same category. Accordingly, we establish Semantic Affinity Loss to supervise the feature alignment within each CIA block. Experimental results on four challenging datasets show that the proposed AENet achieves the state-of-the-art results over prior arts.

PDF

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Fine-Grained Image Classification

Fine-Grained Visual Categorization

Datasets

CUB-200-2011

Stanford Cars

FGVC-Aircraft

Results from the Paper

Add Remove

Ranked #8 on Fine-Grained Image Classification on FGVC Aircraft

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Fine-Grained Image Classification	CUB-200-2011	AENet	Accuracy	90.0%	# 25	Compare
Fine-Grained Image Classification	FGVC Aircraft	AENet	Accuracy	94.5%	# 8	Compare
Fine-Grained Image Classification	Stanford Cars	AENet	Accuracy	94.0%	# 47	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Alignment Enhancement Network for Fine-grained Visual Categorization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove