Image Classification
3784 papers with code • 142 benchmarks • 240 datasets
Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.
Libraries
Use these libraries to find Image Classification models and implementationsDatasets
Subtasks
- Out of Distribution (OOD) Detection
- Few-Shot Image Classification
- Fine-Grained Image Classification
- Semi-Supervised Image Classification
- Semi-Supervised Image Classification
- Learning with noisy labels
- Hyperspectral Image Classification
- Self-Supervised Image Classification
- Small Data Image Classification
- Multi-Label Image Classification
- Genre classification
- Sequential Image Classification
- Unsupervised Image Classification
- Efficient ViTs
- Document Image Classification
- Satellite Image Classification
- Sparse Representation-based Classification
- Photo geolocation estimation
- Image Classification with Differential Privacy
- Token Reduction
- Superpixel Image Classification
- Classification Consistency
- Gallbladder Cancer Detection
- Artistic style classification
- Artist classification
- Temporal Metadata Manipulation Detection
- Misclassification Rate - Natural Adversarial Samples
- Scale Generalisation
Latest papers
CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models
A promising approach towards designing useful task specific explanations with domain experts is based on compositionality of semantic concepts.
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification
This paper presents an innovative disjoint sampling approach for training SOTA models on Hyperspectral image classification (HSIC) tasks.
Pyramid Hierarchical Transformer for Hyperspectral Image Classification
The traditional Transformer model encounters challenges with variable-length input sequences, particularly in Hyperspectral Image Classification (HSIC), leading to efficiency and scalability concerns.
Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification
Hyperspectral image classification is a challenging task due to the high dimensionality and complex nature of hyperspectral data.
3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification
Furthermore, to have high classification performance, there should be a strong interaction between the HSI token and the class (CLS) token.
Next Generation Loss Function for Image Classification
The 5 best functions found were evaluated for different model architectures on a set of standard datasets ranging from 2 to 102 classes and very different sizes.
Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
In this paper, we question if the extremely simple ViTs' fine-tuning performance with a small-scale architecture can also benefit from this pre-training paradigm, which is considerably less studied yet in contrast to the well-established lightweight architecture design methodology with sophisticated components introduced.
InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification
Semi-supervised image classification, leveraging pseudo supervision and consistency regularization, has demonstrated remarkable success.
Vocabulary-free Image Classification and Semantic Segmentation
To address VIC, we propose Category Search from External Databases (CaSED), a training-free method that leverages a pre-trained vision-language model and an external database.
A variable metric proximal stochastic gradient method: an application to classification problems
To control the variance of the objective's gradients, we use an automatic sample size selection along with a variable metric to precondition the stochastic gradient directions.