Fine-Grained Image Classification

173 papers with code • 35 benchmarks • 36 datasets

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Benchmarks

Add a Result

These leaderboards are used to track progress in Fine-Grained Image Classification

Dataset	Best Model	Compare
Stanford Cars	CMAL-Net	See all
CUB-200-2011	HERBS	See all
FGVC Aircraft	SR-GNN	See all
Oxford 102 Flowers	VIT-L/16 (Background)	See all
CUB-200-2011	HERBS	See all
NABirds	MetaFormer (MetaFormer-2,384)	See all
Stanford Dogs	SR-GNN	See all
Oxford-IIIT Pet Dataset	OmniVec	See all
Food-101	CAP	See all
Caltech-101	VIT-L/16	See all
Oxford-IIIT Pets	EffNet-L2 (SAM)	See all
CompCars	ResNet101-swp	See all
Birdsnap	EffNet-L2 (SAM)	See all
Bird-225	WideResNet-101 (Spinal FC)	See all
SUN397	µ2Net (ViT-L/16)	See all
10 Monkey Species	Inception-v3 (Spinal FC)	See all
Fruits-360	ResNeXt-101	See all
FoodX-251	CSWin-L	See all
Imbalanced CUB-200-2011	PC-Softmax	See all
SOP	Assemble-ResNet-FGVC-50	See all
Con-Text	PHOC descriptor + Fisher Vector Encoding	See all
Bottles	PHOC descriptor + Fisher Vector Encoding	See all
MNIST	Vanilla FC layer only	See all
EMNIST-Digits	VGG-5	See all
EMNIST-Letters	VGG-5	See all
QMNIST	VGG-5	See all
Kuzushiji-MNIST	VGG-5	See all
STL-10	Pre trained wide-resnet-101	See all
BoxCars116K	ResNet152 + COOC	See all
CarFlag-1532	ResNet101-swp	See all
CarFlag-563	ResNet101-swp	See all
iNaturalist	TASN	See all
FGVC-Aircraft	EnGraf-Net101 (G=4, H=1)	See all
Herbarium 2021 Half–Earth	Conviformer-B	See all
Herbarium 2022	Conviformer-B	See all

Show all 35 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Fine-Grained Image Classification models and implementations

rwightman/pytorch-image-models

7 papers

29,949

open-mmlab/mmclassification

4 papers

3,188

osmr/imgclsmob

4 papers

2,924

Westlake-AI/openmixup

4 papers

576

See all 25 libraries.

Datasets

Subtasks

Displaced People Recognition

Most implemented papers

Most implemented Social Latest No code

TResNet: High Performance GPU-Dedicated Architecture

rwightman/pytorch-image-models • • 30 Mar 2020

In this work, we introduce a series of architecture modifications that aim to boost neural networks' accuracy, while retaining their GPU training and inference efficiency.

Paper
Code

Proxy Anchor Loss for Deep Metric Learning

tjddus9597/Proxy-Anchor-CVPR2020 • • CVPR 2020

The former class can leverage fine-grained semantic relations between data points, but slows convergence in general due to its high training complexity.

Paper
Code

SpinalNet: Deep Neural Network with Gradual Input

dipuk0506/SpinalNet • • arXiv 2020

Traditional learning with ImageNet pre-trained initial weights and SpinalNet classification layers provided the SOTA performance on STL-10, Fruits 360, Bird225, and Caltech-101 datasets.

Paper
Code

Roll With the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning

njuyued/soc4ss-fgvc • • 19 Dec 2023

While semi-supervised learning (SSL) has yielded promising results, the more realistic SSL scenario remains to be explored, in which the unlabeled data exhibits extremely high recognition difficulty, e. g., fine-grained visual classification in the context of SSL (SS-FGVC).

Paper
Code

Evaluation of Output Embeddings for Fine-Grained Image Classification

Image classification has advanced significantly in recent years with the availability of large-scale image sets.

Paper
Code

Destruction and Construction Learning for Fine-Grained Image Recognition

JDAI-CV/DCL • • CVPR 2019

In this paper, we propose a novel "Destruction and Construction Learning" (DCL) method to enhance the difficulty of fine-grained recognition and exercise the classification model to acquire expert knowledge.

Paper
Code

Classification-Specific Parts for Improving Fine-Grained Visual Categorization

DiKorsch/l1_parts • 16 Sep 2019

Fine-grained visual categorization is a classification task for distinguishing categories with high intra-class and small inter-class variance.

Paper
Code

Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

FlyingMoon-GitHub/ACNet • • CVPR 2020

Specifically, we incorporate convolutional operations along edges of the tree structure, and use the routing functions in each node to determine the root-to-leaf computational paths within the tree.

Paper
Code

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

DreadPiratePsyopus/Fine_Grained_Clf • • 14 Jan 2020

Text contained in an image carries high-level semantics that can be exploited to achieve richer image understanding.

Paper
Code

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

JDAI-CV/LIO • • CVPR 2020

Specifically, we first propose an object-extent learning module for localizing the object according to the visual patterns shared among the instances in the same category.

Paper
Code

Fine-Grained Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result