Instance Segmentation

981 papers with code • 25 benchmarks • 84 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Benchmarks

Add a Result

These leaderboards are used to track progress in Instance Segmentation

Dataset	Best Model	Compare
COCO test-dev	EVA	See all
COCO minival	InternImage-H	See all
LVIS v1.0 val	Co-DETR (single-scale)	See all
Cityscapes val	OneFormer (InternImage-H, emb_dim=256, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained)	See all
Cityscapes test	Deep Watershed Transform	See all
Occluded COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
Separated COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
iSAID	PANet++	See all
TBBR	Swin-T (ImageNet-1k pretrain)	See all
COCO 2017 val	SparK (ConvNeXt V1-B Mask R-CNN)	See all
BDD100K val	Mask Transfiner	See all
COCO val (panoptic labels)	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
UIIS	WaterMask RCNN	See all
NYU Depth v2	SGPN-CNN	See all
KINS	BCNet	See all
nuScenes	TraDeS	See all
coco minval	R3-CNN (ResNet-50-FPN, GC-Net)	See all
Leaf Segmentation Challenge	LeafMask	See all
iShape	ASIS(baseline)	See all
LVIS v1.0 test-dev	R50-FPN-MaskRCNN-TTA	See all
PartNet	Semantic Segmentation-Assisted Instance Feature Fusion	See all
COCO val2017	MogaNet-S (256x192)	See all
Object Detection on COCO minival	DaViT-T (Mask R-CNN, 36 epochs)	See all
LDD	R^3-CNN	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Instance Segmentation models and implementations

open-mmlab/mmdetection

31 papers

27,992

PaddlePaddle/PaddleDetection

18 papers

12,157

rwightman/pytorch-image-models

17 papers

30,001

huggingface/transformers

6 papers

126,108

See all 20 libraries.

Datasets

Subtasks

Unsupervised Object Segmentation

Amodal Instance Segmentation

Box-supervised Instance Segmentation

Unseen Object Instance Segmentation

Image-level Supervised Instance Segmentation

3D Semantic Instance Segmentation

Open-World Instance Segmentation

Human Instance Segmentation

One-Shot Instance Segmentation

Semi-Supervised Person Instance Segmentation

Point-Supervised Instance Segmentation

Solar Cell Segmentation

Most implemented papers

Most implemented Social Latest No code

Rethinking Channel Dimensions for Efficient Model Design

clovaai/rexnet • • CVPR 2021

We then investigate the channel configuration of a model by searching network architectures concerning the channel configuration under the computational cost restriction.

Paper
Code

Panoptic Segmentation

cocodataset/panopticapi • CVPR 2019

We propose and study a task we name panoptic segmentation (PS).

Paper
Code

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

xvjiarui/GCNet • • 25 Apr 2019

In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation.

Paper
Code

Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation

bowenc0221/panoptic-deeplab • • CVPR 2020

In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve comparable performance of two-stage methods while yielding fast inference speed.

Paper
Code

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

aim-uofa/AdelaiDet • • CVPR 2020

The proposed BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer, thus being fast in inference.

Paper
Code

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

whai362/PVT • • ICCV 2021

Unlike the recently-proposed Transformer model (e. g., ViT) that is specially designed for image classification, we propose Pyramid Vision Transformer~(PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks.

Paper
Code

Co-Scale Conv-Attentional Image Transformers

mlpc-ucsd/CoaT • • ICCV 2021

In this paper, we present Co-scale conv-attentional image Transformers (CoaT), a Transformer-based image classifier equipped with co-scale and conv-attentional mechanisms.

Paper
Code

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

idea-research/maskdino • • CVPR 2023

In this paper we present Mask DINO, a unified object detection and segmentation framework.

Paper
Code

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

open-mmlab/mmdetection • • 14 Dec 2022

In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO series and is easily extensible for many object recognition tasks such as instance segmentation and rotated object detection.

Paper
Code

Semantic Instance Segmentation with a Discriminative Loss Function

Wizaron/instance-segmentation-pytorch • • 8 Aug 2017

In this work we propose to tackle the problem with a discriminative loss function, operating at the pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step.

Paper
Code

Instance Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result