Panoptic Segmentation

213 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Panoptic Segmentation

Dataset	Best Model	Compare
COCO test-dev	Mask DINO (single scale)	See all
Cityscapes val	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	See all
COCO minival	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	See all
Mapillary val	OneFormer (DiNAT-L, single-scale)	See all
Cityscapes test	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	See all
LaRS	Mask2Former (Swin-B)	See all
S3DIS Area5	SuperCluster	See all
KITTI Panoptic Segmentation	EfficientPS	See all
Indian Driving Dataset	EfficientPS	See all
ScanNetV2	OneFormer3D	See all
ScanNet	OneFormer3D	See all
PASTIS	Exchanger+Mask2Former	See all
SemanticKITTI	P3Former	See all
PanNuke	CellViT-SAM-H	See all
COCO panoptic	VAN-B6*	See all
NYU Depth v2	EMSANet	See all
SUN-RGBD	EMSANet	See all
Panoptic nuScenes val	PolarSeg-Panoptic	See all
Panoptic nuScenes test	(AF)2-S3Net + CenterPoint	See all
PASTIS-R	Early Fusion	See all
S3DIS	SuperCluster	See all
KITTI-360	SuperCluster	See all
DALES	SuperCluster	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

open-mmlab/mmdetection

9 papers

27,708

huggingface/transformers

7 papers

124,527

google-research/deeplab2

7 papers

982

PaddlePaddle/PaddleDetection

5 papers

12,029

See all 15 libraries.

Datasets

Subtasks

Video Panoptic Segmentation

Most implemented papers

Most implemented Social Latest No code

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

sacmehta/ESPNet • • ECCV 2018

We introduce a fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints.

Paper
Code

CenterMask : Real-Time Anchor-Free Instance Segmentation

youngwanLEE/CenterMask • • arXiv 2019

We hope that CenterMask and VoVNetV2 can serve as a solid baseline of real-time instance segmentation and backbone network for various vision tasks, respectively.

Paper
Code

Hierarchical Multi-Scale Attention for Semantic Segmentation

NVIDIA/semantic-segmentation • • 21 May 2020

Multi-scale inference is commonly used to improve the results of semantic segmentation.

Paper
Code

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

joe-siyuan-qiao/DetectoRS • • CVPR 2021

In this paper, we explore this mechanism in the backbone design for object detection.

Paper
Code

Fully Convolutional Networks for Panoptic Segmentation

yanwei-li/PanopticFCN • • CVPR 2021

In this paper, we present a conceptually simple, strong, and efficient framework for panoptic segmentation, called Panoptic FCN.

Paper
Code

Masked-attention Mask Transformer for Universal Image Segmentation

facebookresearch/Mask2Former • • CVPR 2022

While only the semantics of each task differ, current research focuses on designing specialized architectures for each task.

Paper
Code

Focal Modulation Networks

microsoft/FocalNet • • 22 Mar 2022

For semantic segmentation with UPerNet, FocalNet base at single-scale outperforms Swin by 2. 4, and beats Swin at multi-scale (50. 5 v. s.

Paper
Code

Seamless Scene Segmentation

mapillary/seamseg • • CVPR 2019

In this work we introduce a novel, CNN-based architecture that can be trained end-to-end to deliver seamless scene segmentation results.

Paper
Code

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

google-research/deeplab2 • • ECCV 2020

In this paper, we attempt to remove this constraint by factorizing 2D self-attention into two 1D self-attentions.

Paper
Code

Mask2Former for Video Instance Segmentation

facebookresearch/Mask2Former • • 20 Dec 2021

We find Mask2Former also achieves state-of-the-art performance on video instance segmentation without modifying the architecture, the loss or even the training pipeline.

Paper
Code

Panoptic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result