Instance Segmentation

960 papers with code • 25 benchmarks • 82 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Benchmarks

Add a Result

These leaderboards are used to track progress in Instance Segmentation

Dataset	Best Model	Compare
COCO test-dev	EVA	See all
COCO minival	InternImage-H	See all
LVIS v1.0 val	Co-DETR (single-scale)	See all
Cityscapes val	OneFormer (InternImage-H, emb_dim=256, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained)	See all
Cityscapes test	Deep Watershed Transform	See all
Occluded COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
Separated COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
iSAID	PANet++	See all
TBBR	Swin-T (ImageNet-1k pretrain)	See all
COCO 2017 val	SparK (ConvNeXt V1-B Mask R-CNN)	See all
BDD100K val	Mask Transfiner	See all
COCO val (panoptic labels)	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
UIIS	WaterMask RCNN	See all
NYU Depth v2	SGPN-CNN	See all
KINS	BCNet	See all
nuScenes	TraDeS	See all
coco minval	R3-CNN (ResNet-50-FPN, GC-Net)	See all
Leaf Segmentation Challenge	LeafMask	See all
iShape	ASIS(baseline)	See all
LVIS v1.0 test-dev	R50-FPN-MaskRCNN-TTA	See all
PartNet	Semantic Segmentation-Assisted Instance Feature Fusion	See all
COCO val2017	MogaNet-S (256x192)	See all
Object Detection on COCO minival	DaViT-T (Mask R-CNN, 36 epochs)	See all
LDD	R^3-CNN	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Instance Segmentation models and implementations

open-mmlab/mmdetection

31 papers

27,708

PaddlePaddle/PaddleDetection

18 papers

12,029

rwightman/pytorch-image-models

17 papers

29,671

huggingface/transformers

6 papers

124,527

See all 20 libraries.

Datasets

Subtasks

Unsupervised Object Segmentation

Amodal Instance Segmentation

Box-supervised Instance Segmentation

Image-level Supervised Instance Segmentation

Unseen Object Instance Segmentation

3D Semantic Instance Segmentation

Open-World Instance Segmentation

Human Instance Segmentation

One-Shot Instance Segmentation

Semi-Supervised Person Instance Segmentation

Point-Supervised Instance Segmentation

Solar Cell Segmentation

Latest papers with no code

Most implemented Social Latest No code

Segment Any 3D Object with Language

no code yet • 2 Apr 2024

In addition, to align the 3D segmentation model with various language instructions and enhance the mask quality, we introduce three types of multimodal associations as supervision.

Paper
Add Code

Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge

no code yet • 1 Apr 2024

Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health.

Paper
Add Code

SUGAR: Pre-training 3D Visual Representations for Robotics

no code yet • 1 Apr 2024

SUGAR employs a versatile transformer-based model to jointly address five pre-training tasks, namely cross-modal knowledge distillation for semantic learning, masked point modeling to understand geometry structures, grasping pose synthesis for object affordance, 3D instance segmentation and referring expression grounding to analyze cluttered scenes.

Paper
Add Code

What is Point Supervision Worth in Video Instance Segmentation?

no code yet • 1 Apr 2024

Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos.

Paper
Add Code

Instance-Aware Group Quantization for Vision Transformers

no code yet • 1 Apr 2024

In particular, the distribution of activations for each channel vary drastically according to input instances, making PTQ methods for CNNs inappropriate for ViTs.

Paper
Add Code

Efficient 3D Instance Mapping and Localization with Neural Fields

no code yet • 28 Mar 2024

The first phase, InstanceMap, takes as input 2D segmentation masks of the image sequence generated by a frontend instance segmentation model, and associates corresponding masks across images to 3D labels.

Paper
Add Code

Annolid: Annotate, Segment, and Track Anything You Need

no code yet • 27 Mar 2024

Annolid is a deep learning-based software package designed for the segmentation, labeling, and tracking of research targets within video files, focusing primarily on animal behavior analysis.

Paper
Add Code

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation

no code yet • 25 Mar 2024

To this end, we propose a novel framework, called GoodSAM, that introduces a teacher assistant (TA) to provide semantic information, integrated with SAM to generate ensemble logits to achieve knowledge transfer.

Paper
Add Code

AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans

no code yet • 24 Mar 2024

To this end, we construct a learning framework consisting of two components: (1) a pseudo-annotation scheme for generating initial unsupervised pseudo-labels; and (2) a self-training algorithm for instance segmentation to fit robust, accurate instances from initial noisy proposals.

Paper
Add Code

Language-Based Depth Hints for Monocular Depth Estimation

no code yet • 22 Mar 2024

In this work, we demonstrate the use of natural language as a source of an explicit prior about the structure of the world.

Paper
Add Code

Instance Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result