Image Classification
3772 papers with code • 168 benchmarks • 240 datasets
Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.
Libraries
Use these libraries to find Image Classification models and implementationsDatasets
Subtasks
- Out of Distribution (OOD) Detection
- Few-Shot Image Classification
- Fine-Grained Image Classification
- Semi-Supervised Image Classification
- Semi-Supervised Image Classification
- Learning with noisy labels
- Hyperspectral Image Classification
- Self-Supervised Image Classification
- Small Data Image Classification
- Multi-Label Image Classification
- Genre classification
- Sequential Image Classification
- Unsupervised Image Classification
- Efficient ViTs
- Document Image Classification
- Satellite Image Classification
- Sparse Representation-based Classification
- Photo geolocation estimation
- Image Classification with Differential Privacy
- Token Reduction
- Superpixel Image Classification
- Classification Consistency
- Gallbladder Cancer Detection
- Artistic style classification
- Artist classification
- Temporal Metadata Manipulation Detection
- Misclassification Rate - Natural Adversarial Samples
- Scale Generalisation
Latest papers with no code
Coreset Selection for Object Detection
Coreset selection is a method for selecting a small, representative subset of an entire dataset.
Exploring Explainability in Video Action Recognition
To address these, we introduce Video-TCAV, by building on TCAV for Image Classification tasks, which aims to quantify the importance of specific concepts in the decision-making process of Video Action Recognition models.
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification
The other is to classification on feature distribution of visual tokens from vision encoder.
A Survey of Neural Network Robustness Assessment in Image Recognition
We investigate the perturbation metrics and range representations used to measure the degree of perturbations on images, as well as the robustness metrics specifically for the robustness conditions of classification models.
SpectralMamba: Efficient Mamba for Hyperspectral Image Classification
Recurrent neural networks and Transformers have recently dominated most applications in hyperspectral (HS) imaging, owing to their capability to capture long-range dependencies from spectrum sequences.
Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example
To address the challenges of dependence on pathologists expertise and the time-consuming nature of achieving accurate breast pathological image classification, this paper introduces an approach utilizing convolutional neural networks (CNNs) for the rapid categorization of pathological images, aiming to enhance the efficiency of breast pathological image detection.
Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling
Downsampling operators break the shift invariance of convolutional neural networks (CNNs) and this affects the robustness of features learned by CNNs when dealing with even small pixel-level shift.
Learning to Classify New Foods Incrementally Via Compressed Exemplars
Therefore, food image classification systems should adapt to and manage data that continuously evolves.
CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models
Computer-aided diagnosis systems hold great promise to aid radiologists and clinicians in radiological clinical practice and enhance diagnostic accuracy and efficiency.
Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark
This method aims to mitigate forgetting while adapting to new classes and domain shifts by combining the advantages of the Replay and Pseudo-Label methods and solving their limitations in the proposed scenario.