Panoptic Segmentation

214 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Panoptic Segmentation

Dataset	Best Model	Compare
COCO test-dev	Mask DINO (single scale)	See all
Cityscapes val	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	See all
COCO minival	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	See all
Mapillary val	OneFormer (DiNAT-L, single-scale)	See all
Cityscapes test	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	See all
LaRS	Mask2Former (Swin-B)	See all
S3DIS Area5	SuperCluster	See all
KITTI Panoptic Segmentation	EfficientPS	See all
Indian Driving Dataset	EfficientPS	See all
ScanNetV2	OneFormer3D	See all
ScanNet	OneFormer3D	See all
PASTIS	Exchanger+Mask2Former	See all
SemanticKITTI	P3Former	See all
PanNuke	CellViT-SAM-H	See all
COCO panoptic	VAN-B6*	See all
NYU Depth v2	EMSANet	See all
SUN-RGBD	EMSANet	See all
Panoptic nuScenes val	PolarSeg-Panoptic	See all
Panoptic nuScenes test	(AF)2-S3Net + CenterPoint	See all
PASTIS-R	Early Fusion	See all
S3DIS	SuperCluster	See all
KITTI-360	SuperCluster	See all
DALES	SuperCluster	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

open-mmlab/mmdetection

9 papers

27,852

huggingface/transformers

7 papers

125,334

google-research/deeplab2

7 papers

988

PaddlePaddle/PaddleDetection

5 papers

12,086

See all 15 libraries.

Datasets

Subtasks

Video Panoptic Segmentation

Most implemented papers

Most implemented Social Latest No code

Dilated Neighborhood Attention Transformer

SHI-Labs/Neighborhood-Attention-Transformer • • 29 Sep 2022

These models typically employ localized attention mechanisms, such as the sliding-window Neighborhood Attention (NA) or Swin Transformer's Shifted Window Self Attention.

Paper
Code

BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning

bdd100k/bdd100k • CVPR 2020

Datasets drive vision progress, yet existing driving datasets are impoverished in terms of visual content and supported tasks to study multitask learning for autonomous driving.

Paper
Code

Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding

tue-mps/panoptic_parts • • 16 Apr 2020

In this technical report, we present two novel datasets for image scene understanding.

Paper
Code

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

Expedit-LargeScale-Vision-Transformer/Expedit-DPT • • 3 Oct 2022

Vision transformers have recently achieved competitive results across various vision tasks but still suffer from heavy computation costs when processing a large number of tokens.

Paper
Code

FlexiViT: One Model for All Patch Sizes

google-research/big_vision • • CVPR 2023

Vision Transformers convert images to sequences by slicing them into patches.

Paper
Code

MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

google-research/deeplab2 • • CVPR 2021

As a result, MaX-DeepLab shows a significant 7. 1% PQ gain in the box-free regime on the challenging COCO dataset, closing the gap between box-based and box-free methods for the first time.

Paper
Code

Exemplar-Based Open-Set Panoptic Segmentation Network

jd730/EOPSN • • CVPR 2021

We extend panoptic segmentation to the open-world and introduce an open-set panoptic segmentation (OPS) task.

Paper
Code

Per-Pixel Classification is Not All You Need for Semantic Segmentation

facebookresearch/MaskFormer • • NeurIPS 2021

Overall, the proposed mask classification-based method simplifies the landscape of effective approaches to semantic and panoptic segmentation tasks and shows excellent empirical results.

Paper
Code

Finite Scalar Quantization: VQ-VAE Made Simple

google-research/google-research • • 27 Sep 2023

Each dimension is quantized to a small set of fixed values, leading to an (implicit) codebook given by the product of these sets.

Paper
Code

Panoptic Video Scene Graph Generation

jingkang50/openpvsg • • CVPR 2023

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

Paper
Code

Panoptic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result