Fine-Grained Image Classification

174 papers with code • 35 benchmarks • 36 datasets

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Benchmarks

Add a Result

These leaderboards are used to track progress in Fine-Grained Image Classification

Dataset	Best Model	Compare
Stanford Cars	CMAL-Net	See all
CUB-200-2011	HERBS	See all
FGVC Aircraft	SR-GNN	See all
Oxford 102 Flowers	VIT-L/16 (Background)	See all
CUB-200-2011	HERBS	See all
NABirds	MetaFormer (MetaFormer-2,384)	See all
Oxford-IIIT Pet Dataset	OmniVec	See all
Stanford Dogs	SR-GNN	See all
Food-101	CAP	See all
Caltech-101	VIT-L/16	See all
Oxford-IIIT Pets	EffNet-L2 (SAM)	See all
CompCars	ResNet101-swp	See all
Birdsnap	EffNet-L2 (SAM)	See all
Bird-225	WideResNet-101 (Spinal FC)	See all
SUN397	µ2Net (ViT-L/16)	See all
10 Monkey Species	Inception-v3 (Spinal FC)	See all
Fruits-360	ResNeXt-101	See all
FoodX-251	CSWin-L	See all
Imbalanced CUB-200-2011	PC-Softmax	See all
SOP	Assemble-ResNet-FGVC-50	See all
Con-Text	PHOC descriptor + Fisher Vector Encoding	See all
Bottles	PHOC descriptor + Fisher Vector Encoding	See all
MNIST	Vanilla FC layer only	See all
EMNIST-Digits	VGG-5	See all
EMNIST-Letters	VGG-5	See all
QMNIST	VGG-5	See all
Kuzushiji-MNIST	VGG-5	See all
STL-10	Pre trained wide-resnet-101	See all
BoxCars116K	ResNet152 + COOC	See all
CarFlag-1532	ResNet101-swp	See all
CarFlag-563	ResNet101-swp	See all
iNaturalist	TASN	See all
FGVC-Aircraft	EnGraf-Net101 (G=4, H=1)	See all
Herbarium 2021 Half–Earth	Conviformer-B	See all
Herbarium 2022	Conviformer-B	See all

Show all 35 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Fine-Grained Image Classification models and implementations

rwightman/pytorch-image-models

7 papers

29,758

open-mmlab/mmclassification

4 papers

3,157

osmr/imgclsmob

4 papers

2,917

Westlake-AI/openmixup

4 papers

570

See all 25 libraries.

Datasets

Subtasks

Displaced People Recognition

Most implemented papers

Most implemented Social Latest No code

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

google-research/vision_transformer • • ICLR 2021

While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited.

143

Paper
Code

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

tensorflow/tpu • • ICML 2019

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available.

133

Paper
Code

AutoAugment: Learning Augmentation Policies from Data

tensorflow/models • • 24 May 2018

In our implementation, we have designed a search space where a policy consists of many sub-policies, one of which is randomly chosen for each image in each mini-batch.

Paper
Code

Training data-efficient image transformers & distillation through attention

facebookresearch/deit • • 23 Dec 2020

In this work, we produce a competitive convolution-free transformer by training on Imagenet only.

Paper
Code

ResMLP: Feedforward networks for image classification with data-efficient training

facebookresearch/deit • • NeurIPS 2021

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification.

Paper
Code

Sharpness-Aware Minimization for Efficiently Improving Generalization

google-research/sam • • ICLR 2021

In today's heavily overparameterized models, the value of the training loss provides few guarantees on model generalization ability.

Paper
Code

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

tensorflow/lingvo • • NeurIPS 2019

Scaling up deep neural network capacity has been known as an effective approach to improving model quality for several different machine learning tasks.

Paper
Code

Learning to Navigate for Fine-grained Classification

yangze0930/NTS-Net • • ECCV 2018

In consideration of intrinsic consistency between informativeness of the regions and their probability being ground-truth class, we design a novel training paradigm, which enables Navigator to detect most informative regions under the guidance from Teacher.

Paper
Code

Transformer in Transformer

huawei-noah/CV-Backbones • • NeurIPS 2021

In this paper, we point out that the attention inside these local patches are also essential for building visual transformers with high performance and we explore a new architecture, namely, Transformer iN Transformer (TNT).

Paper
Code

ResNet strikes back: An improved training procedure in timm

rwightman/pytorch-image-models • • NeurIPS Workshop ImageNet_PPF 2021

We share competitive training settings and pre-trained models in the timm open-source library, with the hope that they will serve as better baselines for future work.

Paper
Code

Fine-Grained Image Classification

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result