Instance Segmentation

960 papers with code • 25 benchmarks • 82 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Libraries

Use these libraries to find Instance Segmentation models and implementations

Latest papers with no code

Segment Any 3D Object with Language

no code yet • 2 Apr 2024

In addition, to align the 3D segmentation model with various language instructions and enhance the mask quality, we introduce three types of multimodal associations as supervision.

Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge

no code yet • 1 Apr 2024

Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health.

SUGAR: Pre-training 3D Visual Representations for Robotics

no code yet • 1 Apr 2024

SUGAR employs a versatile transformer-based model to jointly address five pre-training tasks, namely cross-modal knowledge distillation for semantic learning, masked point modeling to understand geometry structures, grasping pose synthesis for object affordance, 3D instance segmentation and referring expression grounding to analyze cluttered scenes.

What is Point Supervision Worth in Video Instance Segmentation?

no code yet • 1 Apr 2024

Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos.

Instance-Aware Group Quantization for Vision Transformers

no code yet • 1 Apr 2024

In particular, the distribution of activations for each channel vary drastically according to input instances, making PTQ methods for CNNs inappropriate for ViTs.

Efficient 3D Instance Mapping and Localization with Neural Fields

no code yet • 28 Mar 2024

The first phase, InstanceMap, takes as input 2D segmentation masks of the image sequence generated by a frontend instance segmentation model, and associates corresponding masks across images to 3D labels.

Annolid: Annotate, Segment, and Track Anything You Need

no code yet • 27 Mar 2024

Annolid is a deep learning-based software package designed for the segmentation, labeling, and tracking of research targets within video files, focusing primarily on animal behavior analysis.

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation

no code yet • 25 Mar 2024

To this end, we propose a novel framework, called GoodSAM, that introduces a teacher assistant (TA) to provide semantic information, integrated with SAM to generate ensemble logits to achieve knowledge transfer.

AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans

no code yet • 24 Mar 2024

To this end, we construct a learning framework consisting of two components: (1) a pseudo-annotation scheme for generating initial unsupervised pseudo-labels; and (2) a self-training algorithm for instance segmentation to fit robust, accurate instances from initial noisy proposals.

Language-Based Depth Hints for Monocular Depth Estimation

no code yet • 22 Mar 2024

In this work, we demonstrate the use of natural language as a source of an explicit prior about the structure of the world.