Instance Segmentation
971 papers with code • 25 benchmarks • 83 datasets
Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.
Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21
Libraries
Use these libraries to find Instance Segmentation models and implementationsDatasets
Subtasks
- Referring Expression Segmentation
- 3D Instance Segmentation
- Real-time Instance Segmentation
- Unsupervised Object Segmentation
- Unsupervised Object Segmentation
- Amodal Instance Segmentation
- Box-supervised Instance Segmentation
- Image-level Supervised Instance Segmentation
- Unseen Object Instance Segmentation
- 3D Semantic Instance Segmentation
- Open-World Instance Segmentation
- Human Instance Segmentation
- One-Shot Instance Segmentation
- Semi-Supervised Person Instance Segmentation
- Point-Supervised Instance Segmentation
- Solar Cell Segmentation
Most implemented papers
Is Heuristic Sampling Necessary in Training Deep Object Detectors?
In this paper, we challenge the necessity of such hard/soft sampling methods for training accurate deep object detectors.
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
We propose SpineNet, a backbone with scale-permuted intermediate features and cross-scale connections that is learned on an object detection task by Neural Architecture Search.
Bottleneck Transformers for Visual Recognition
Finally, we present a simple adaptation of the BoTNet design for image classification, resulting in models that achieve a strong performance of 84. 7% top-1 accuracy on the ImageNet benchmark while being up to 1. 64x faster in compute time than the popular EfficientNet models on TPU-v3 hardware.
Efficient Attention: Attention with Linear Complexities
Dot-product attention has wide applications in computer vision and natural language processing.
Panoptic Feature Pyramid Networks
In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks.
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
By dissecting the channel attention module in SENet, we empirically show avoiding dimensionality reduction is important for learning channel attention, and appropriate cross-channel interaction can preserve performance while significantly decreasing model complexity.
UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation
The state-of-the-art models for medical image segmentation are variants of U-Net and fully convolutional networks (FCN).
XCiT: Cross-Covariance Image Transformers
We propose a "transposed" version of self-attention that operates across feature channels rather than tokens, where the interactions are based on the cross-covariance matrix between keys and queries.
Path Aggregation Network for Instance Segmentation
The way that information propagates in neural networks is of great importance.
Key Points Estimation and Point Instance Segmentation Approach for Lane Detection
In the case of traffic line detection, an essential perception module, many condition should be considered, such as number of traffic lines and computing power of the target system.