Instance Segmentation

960 papers with code • 25 benchmarks • 82 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Libraries

Use these libraries to find Instance Segmentation models and implementations

NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer

michaelwwan/noise 15 Apr 2024

In the last few years, a handful of machine learning approaches for osteoclast image analysis have been developed, but none have addressed the full instance segmentation task required to produce the same output as that of the human expert led process.

2
15 Apr 2024

ViM-UNet: Vision Mamba for Biomedical Segmentation

faceonlive/ai-research 11 Apr 2024

Here, we introduce ViM-UNet, a novel segmentation architecture based on it and compare it to UNet and UNETR for two challenging microscopy instance segmentation tasks.

131
11 Apr 2024

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

clovaai/ECLIPSE 29 Mar 2024

Panoptic segmentation, combining semantic and instance segmentation, stands as a cutting-edge computer vision task.

6
29 Mar 2024

DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

naver-ai/rdnet 28 Mar 2024

This paper revives Densely Connected Convolutional Networks (DenseNets) and reveals the underrated effectiveness over predominant ResNet-style architectures.

27
28 Mar 2024

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

chenhongyiyang/plainmamba 26 Mar 2024

In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability to learn features from two-dimensional images by (i) a continuous 2D scanning process that improves spatial continuity by ensuring adjacency of tokens in the scanning sequence, and (ii) direction-aware updating which enables the model to discern the spatial relations of tokens by encoding directional information.

36
26 Mar 2024

Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer

badripatro/sct 26 Mar 2024

Transformers used in vision have been investigated through diverse architectures - ViT, PVT, and Swin.

1
26 Mar 2024

BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation

peoplelu/bsnet 22 Mar 2024

To generate higher quality pseudo-labels and achieve more precise weakly supervised 3DIS results, we propose the Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (BSNet), which devises a novel pseudo-labeler called Simulation-assisted Transformer.

1
22 Mar 2024

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

vitae-transformer/mtp 20 Mar 2024

However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as image classification or object discrimination tasks.

81
20 Mar 2024

CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation

zwq456/clip-vis 19 Mar 2024

Given a set of initial queries, class-agnostic mask generation employs a transformer decoder to predict query masks and corresponding object scores and mask IoU scores.

25
19 Mar 2024

Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery

zyqz97/aerial_lifting 18 Mar 2024

We then introduce a novel cross-view instance label grouping strategy based on the 3D scene representation to mitigate the multi-view inconsistency problem in the 2D instance labels.

31
18 Mar 2024