Fine-Grained Image Classification

173 papers with code • 35 benchmarks • 36 datasets

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Benchmarks

Add a Result

These leaderboards are used to track progress in Fine-Grained Image Classification

Dataset	Best Model	Compare
Stanford Cars	CMAL-Net	See all
CUB-200-2011	HERBS	See all
FGVC Aircraft	SR-GNN	See all
Oxford 102 Flowers	VIT-L/16 (Background)	See all
CUB-200-2011	HERBS	See all
NABirds	MetaFormer (MetaFormer-2,384)	See all
Oxford-IIIT Pet Dataset	OmniVec	See all
Stanford Dogs	SR-GNN	See all
Food-101	CAP	See all
Caltech-101	VIT-L/16	See all
Oxford-IIIT Pets	EffNet-L2 (SAM)	See all
CompCars	ResNet101-swp	See all
Birdsnap	EffNet-L2 (SAM)	See all
Bird-225	WideResNet-101 (Spinal FC)	See all
SUN397	µ2Net (ViT-L/16)	See all
10 Monkey Species	Inception-v3 (Spinal FC)	See all
Fruits-360	ResNeXt-101	See all
FoodX-251	CSWin-L	See all
Imbalanced CUB-200-2011	PC-Softmax	See all
SOP	Assemble-ResNet-FGVC-50	See all
Con-Text	PHOC descriptor + Fisher Vector Encoding	See all
Bottles	PHOC descriptor + Fisher Vector Encoding	See all
MNIST	Vanilla FC layer only	See all
EMNIST-Digits	VGG-5	See all
EMNIST-Letters	VGG-5	See all
QMNIST	VGG-5	See all
Kuzushiji-MNIST	VGG-5	See all
STL-10	Pre trained wide-resnet-101	See all
BoxCars116K	ResNet152 + COOC	See all
CarFlag-1532	ResNet101-swp	See all
CarFlag-563	ResNet101-swp	See all
iNaturalist	TASN	See all
FGVC-Aircraft	EnGraf-Net101 (G=4, H=1)	See all
Herbarium 2021 Half–Earth	Conviformer-B	See all
Herbarium 2022	Conviformer-B	See all

Show all 35 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Fine-Grained Image Classification models and implementations

rwightman/pytorch-image-models

7 papers

29,789

open-mmlab/mmclassification

4 papers

3,160

osmr/imgclsmob

4 papers

2,917

Westlake-AI/openmixup

4 papers

572

See all 25 libraries.

Datasets

Subtasks

Displaced People Recognition

Most implemented papers

Most implemented Social Latest No code

See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

wvinzh/WS_DAN_PyTorch • • 26 Jan 2019

Specifically, for each training image, we first generate attention maps to represent the object's discriminative parts by weakly supervised learning.

Paper
Code

Presence-Only Geographical Priors for Fine-Grained Image Classification

gengchenmai/space2vec • • ICCV 2019

Appearance information alone is often not sufficient to accurately differentiate between fine-grained visual categories.

Paper
Code

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

kakaobrain/coyo-dataset • • 11 Feb 2021

In this paper, we leverage a noisy dataset of over one billion image alt-text pairs, obtained without expensive filtering or post-processing steps in the Conceptual Captions dataset.

Paper
Code

ImageNet-21K Pretraining for the Masses

Alibaba-MIIL/ImageNet21K • • 22 Apr 2021

ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks.

Paper
Code

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations

lightly-ai/lightly • • ICCV 2021

On semi-supervised learning benchmarks we improve performance significantly when only 1% ImageNet labels are available, from 53. 8% to 56. 5%.

Paper
Code

A Large-Scale Car Dataset for Fine-Grained Categorization and Verification

duongttr/SWP • • CVPR 2015

Updated on 24/09/2015: This update provides preliminary experiment results for fine-grained classification on the surveillance data of CompCars.

Paper
Code

Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition

Jianlong-Fu/Multi-Attention-CNN • ICCV 2017

Two losses are proposed to guide the multi-task learning of channel grouping and part classification, which encourages MA-CNN to generate more discriminative parts from feature channels and learn better fine-grained features from parts in a mutual reinforced way.

Paper
Code

Fixing the train-test resolution discrepancy

facebookresearch/FixRes • • NeurIPS 2019

Conversely, when training a ResNeXt-101 32x48d pre-trained in weakly-supervised fashion on 940 million public images at resolution 224x224 and further optimizing for test resolution 320x320, we obtain a test top-1 accuracy of 86. 4% (top-5: 98. 0%) (single-crop).

Paper
Code

Are These Birds Similar: Learning Branched Networks for Fine-grained Representations

nicolalandro/ntsnet-cub200 • • 16 Jan 2020

In recent years, natural language descriptions are used to obtain information on discriminative parts of the object.

Paper
Code

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

dongliangchang/Mutual-Channel-Loss • • 11 Feb 2020

The proposed loss function, termed as mutual-channel loss (MC-Loss), consists of two channel-specific components: a discriminality component and a diversity component.

Paper
Code

Fine-Grained Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result