Instance Segmentation

960 papers with code • 25 benchmarks • 82 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Benchmarks

Add a Result

These leaderboards are used to track progress in Instance Segmentation

Dataset	Best Model	Compare
COCO test-dev	EVA	See all
COCO minival	InternImage-H	See all
LVIS v1.0 val	Co-DETR (single-scale)	See all
Cityscapes val	OneFormer (InternImage-H, emb_dim=256, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained)	See all
Cityscapes test	Deep Watershed Transform	See all
Occluded COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
Separated COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
iSAID	PANet++	See all
TBBR	Swin-T (ImageNet-1k pretrain)	See all
COCO 2017 val	SparK (ConvNeXt V1-B Mask R-CNN)	See all
BDD100K val	Mask Transfiner	See all
COCO val (panoptic labels)	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
UIIS	WaterMask RCNN	See all
NYU Depth v2	SGPN-CNN	See all
KINS	BCNet	See all
nuScenes	TraDeS	See all
coco minval	R3-CNN (ResNet-50-FPN, GC-Net)	See all
Leaf Segmentation Challenge	LeafMask	See all
iShape	ASIS(baseline)	See all
LVIS v1.0 test-dev	R50-FPN-MaskRCNN-TTA	See all
PartNet	Semantic Segmentation-Assisted Instance Feature Fusion	See all
COCO val2017	MogaNet-S (256x192)	See all
Object Detection on COCO minival	DaViT-T (Mask R-CNN, 36 epochs)	See all
LDD	R^3-CNN	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Instance Segmentation models and implementations

open-mmlab/mmdetection

31 papers

27,708

PaddlePaddle/PaddleDetection

18 papers

12,029

rwightman/pytorch-image-models

17 papers

29,671

huggingface/transformers

6 papers

124,527

See all 20 libraries.

Datasets

Subtasks

Unsupervised Object Segmentation

Amodal Instance Segmentation

Box-supervised Instance Segmentation

Image-level Supervised Instance Segmentation

Unseen Object Instance Segmentation

3D Semantic Instance Segmentation

Open-World Instance Segmentation

Human Instance Segmentation

One-Shot Instance Segmentation

Semi-Supervised Person Instance Segmentation

Point-Supervised Instance Segmentation

Solar Cell Segmentation

Latest papers

Most implemented Social Latest No code

NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer

michaelwwan/noise • • 15 Apr 2024

In the last few years, a handful of machine learning approaches for osteoclast image analysis have been developed, but none have addressed the full instance segmentation task required to produce the same output as that of the human expert led process.

15 Apr 2024

Paper
Code

ViM-UNet: Vision Mamba for Biomedical Segmentation

faceonlive/ai-research • 11 Apr 2024

Here, we introduce ViM-UNet, a novel segmentation architecture based on it and compare it to UNet and UNETR for two challenging microscopy instance segmentation tasks.

131

11 Apr 2024

Paper
Code

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

clovaai/ECLIPSE • 29 Mar 2024

Panoptic segmentation, combining semantic and instance segmentation, stands as a cutting-edge computer vision task.

29 Mar 2024

Paper
Code

DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

naver-ai/rdnet • • 28 Mar 2024

This paper revives Densely Connected Convolutional Networks (DenseNets) and reveals the underrated effectiveness over predominant ResNet-style architectures.

28 Mar 2024

Paper
Code

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

chenhongyiyang/plainmamba • • 26 Mar 2024

In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability to learn features from two-dimensional images by (i) a continuous 2D scanning process that improves spatial continuity by ensuring adjacency of tokens in the scanning sequence, and (ii) direction-aware updating which enables the model to discern the spatial relations of tokens by encoding directional information.

26 Mar 2024

Paper
Code

Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer

badripatro/sct • • 26 Mar 2024

Transformers used in vision have been investigated through diverse architectures - ViT, PVT, and Swin.

26 Mar 2024

Paper
Code

BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation

peoplelu/bsnet • • 22 Mar 2024

To generate higher quality pseudo-labels and achieve more precise weakly supervised 3DIS results, we propose the Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (BSNet), which devises a novel pseudo-labeler called Simulation-assisted Transformer.

22 Mar 2024

Paper
Code

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

vitae-transformer/mtp • • 20 Mar 2024

However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as image classification or object discrimination tasks.

20 Mar 2024

Paper
Code

CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation

zwq456/clip-vis • • 19 Mar 2024

Given a set of initial queries, class-agnostic mask generation employs a transformer decoder to predict query masks and corresponding object scores and mask IoU scores.

19 Mar 2024

Paper
Code

Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery

zyqz97/aerial_lifting • • 18 Mar 2024

We then introduce a novel cross-view instance label grouping strategy based on the 3D scene representation to mitigate the multi-view inconsistency problem in the 2D instance labels.

18 Mar 2024

Paper
Code

Instance Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result