Semantic Segmentation

5335 papers with code • 126 benchmarks • 317 datasets

Semantic Segmentation is a computer vision task in which the goal is to categorize each pixel in an image into a class or object. The goal is to produce a dense pixel-wise segmentation map of an image, where each pixel is assigned to a specific class or object. Some example benchmarks for this task are Cityscapes, PASCAL VOC and ADE20K. Models are usually evaluated with the Mean Intersection-Over-Union (Mean IoU) and Pixel Accuracy metrics.

( Image credit: CSAILVision )

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic Segmentation

Dataset	Best Model	Compare
ADE20K	ONE-PEACE	See all
NYU Depth v2	OmniVec	See all
Cityscapes test	VLTSeg	See all
ADE20K val	BEiT-3	See all
Cityscapes val	SERNet-Former	See all
PASCAL Context	PlainSeg (EVA-02-L)	See all
S3DIS	PTv3 + PPT	See all
S3DIS Area5	OmniVec	See all
PASCAL VOC 2012 test	DeepLabv3+ (Xception-65-JFT)	See all
SUN-RGBD	TokenFusion (S)	See all
DensePASS	Trans4PASS+ (multi-scale)	See all
ScanNet	PTv3 + PPT	See all
PASCAL VOC 2012 val	EfficientNet-L2+NAS-FPN (single scale test, with self-training)	See all
DADA-seg	MMUDA	See all
Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	See all
ImageNet-S	TEC (ViT-B/16, 224x224, SSL+FT, mmseg)	See all
LaRS	SWIM^2 (Mask2Former)	See all
CamVid	SERNet-Former	See all
COCO-Stuff test	EVA	See all
iSAID	SegNeXt-L	See all
Semantic3D	Feature Geometric Net	See all
ISPRS Potsdam	AerialFormer-B	See all
Trans10K	Trans4Trans (M)	See all
Dark Zurich	Refign (HRDA)	See all
KITTI-360	CMNeXt (RGB-D-E-LiDAR)	See all
MCubeS	MMSFormer (RGB-A-D-N)	See all
DeLiVER	CMNeXt (RGB-D-E-LiDAR)	See all
UrbanLF	CMNeXt (RGB-LF80)	See all
LIP val	Hulk(Finetune, ViT-L)	See all
ScanNetV2	CMX	See all
GTAV-to-Cityscapes Labels	MIC	See all
Nighttime Driving	TADP	See all
LoveDA	ViT-G12X4	See all
EventScape	CMX (B4)	See all
FMB Dataset	MMSFormer (RGB-Infrared)	See all
ISPRS Vaihingen	LSKNet-S	See all
SpaceNet 1	MAE+MTP(ViT-L)	See all
ZJU-RGB-P	ShareCMP (B4 RGB-FP)	See all
INRIA Aerial Image Labeling	UANet(PVT-V2-B2)	See all
LLRGBD-synthetic	SMMCL (SegNeXt-B)	See all
UPLight	ShareCMP (B2 RGB-FP)	See all
MCubeS (P)	MMSFormer (RGB-A-D)	See all
SpectralWaste	CMX (RGB-HYPER)	See all
DDD17	CMNeXt	See all
DSEC	CMNeXt	See all
KITTI Semantic Segmentation	RPVNet [xu2021rpvnet]	See all
SkyScapes-Dense	SkyScapesNet-Dense	See all
FoodSeg103	FoodSAM	See all
SYNTHIA-to-Cityscapes	HRDA + PiPa	See all
SynPASS	Trans4PASS+	See all
SELMA	CMX	See all
Pothole Mix	Baseline - DeepLabv3+	See all
DELIVER	CMNeXt (RGB-D-E-LiDAR)	See all
VDD	Segformer-B2	See all
Mapillary val	AO-SegNet	See all
MS COCO	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
Stanford2D3D - RGBD	CMX (SegFormer-B4)	See all
Event-based Segmentation Dataset	Bimodal SegNet	See all
GAMUS	TIMF	See all
ACDC Scribbles	ScribFormer	See all
ShapeNet	PatchFormer	See all
UAVid	LSKNet-S	See all
BIG	PSPNet + CascadePSP	See all
PETRAW	NCC Next	See all
Hypersim	MultiMAE (ViT-B)	See all
Structured3D	SFSS-MMSI (RGB+Depth+Normal)	See all
Matterport3D	SFSS-MMSI (RGB+Depth)	See all
CC3M-TagMask	TTD (TCL)	See all
PASCAL VOC 2011 test	Plugin network	See all
RELLIS-3D Dataset	GA-Nav	See all
PASTIS	Exchanger+Mask2Former	See all
SIFT-flow	RBE2E	See all
Stanford2D3D Panoramic - RGBD	CBFC	See all
Toronto-3D L002	SCF-Net	See all
Montgomery County X-ray Set	UNETR + SS-CXR	See all
dacl10k v1 testdev	FPN EfficientNet-B4 w/ Aux loss	See all
SYNTHIA-CVPR’16	SSMA	See all
Freiburg Forest	SSMA	See all
38-Cloud	Cloud-Net+	See all
PASCAL VOC 2007	GALDNet	See all
SkyScapes-Lane	SkyScapesNet-Lane	See all
Kvasir-Instrument	DoubleUNet	See all
Graz-02	VOLO-D5	See all
Cleargrasp (Novel)	Cleargrasp	See all
Cityscapes	SPFNet34M	See all
Endoscapes	MoCo V2 Surg SSL - DeepLabv3+ head	See all
HERA RFI Detection	Nearest Latent Neighbours	See all
LOFAR RFI Detection	Nearest Latent Neighbours	See all
BDD	FasterSeg	See all
COCO-Stuff	Deeplab v2	See all
Cam2BEV	uNetXST	See all
ApolloScape	ERFNet-IntRA-KD (ours)	See all
DroneDeploy	DLv3+ (Xception65)	See all
ManipalUAVid	UVid-Net	See all
Cityscapes VIPriors subset	EfficientSeg	See all
SBCoseg	Dice loss + IS-Triplet loss	See all
PASCAL VOC 2010 test	SIW	See all
PASCAL VOC 2012	DLDL-8s+CRF	See all
COCO-Stuff full	SegFormer-B5 (Single Scale)	See all
PASCAL VOC 2011	DLDL-8s+CRF	See all
AIRS	ICT-Net	See all
WildDash	SIW	See all
OpenEDS	RITnet	See all
SYNTHIA	CGA-Net	See all
PASCAL VOC	SegCLIP	See all
UTFPR-SBD3	EPYNET	See all
DIVA-HisDB	U-Net	See all
ATLANTIS	Erfani et al.	See all
PH2	MFSNet	See all
ISIC 2017	MFSNet	See all
HAM10000	MFSNet	See all
Mila Simulated Floods	FloodTransformer (Ours)	See all
SWIMSEG	ACLNet	See all
SWINSEG	ACLNet	See all
SWINySEG	ACLNet	See all
MixedWM38	WaferSegClassNet	See all
BDD100K val	NiseNet	See all
PASTIS-R	Late Fusion	See all
Cityscapes 3D	TaskPrompter	See all
FLAIR (French Land cover from Aerospace ImageRy)	U-Net baseline	See all
RUGD	GA-Nav	See all
dacl10k v1 testfinal	FPN EfficientNet-B4	See all
SemanticPOSS	TFNet	See all
COCO-Stuff-27	DiffSeg (512)	See all
Forward-Looking Sonar Marine Debris Datasets	Unet+RN34	See all
STARE	UNet	See all

Show all 126 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Semantic Segmentation models and implementations

PaddlePaddle/PaddleSeg

53 papers

8,347

rwightman/pytorch-image-models

33 papers

30,274

osmr/imgclsmob

30 papers

2,926

open-mmlab/mmsegmentation

19 papers

7,570

See all 40 libraries.

Datasets

Subtasks

Weakly-Supervised Semantic Segmentation

Scene Segmentation

Semi-Supervised Semantic Segmentation

Real-Time Semantic Segmentation

3D Part Segmentation

Unsupervised Semantic Segmentation

Road Segmentation

One-Shot Segmentation

Bird's-Eye View Semantic Segmentation

Crack Segmentation

UNET Segmentation

Class-Incremental Semantic Segmentation

Universal Segmentation

Polyp Segmentation

Vision-Language Segmentation

4D Spatio Temporal Semantic Segmentation

Histopathological Segmentation

Attentive segmentation networks

Text-Line Extraction

Aerial Video Semantic Segmentation

Amodal Panoptic Segmentation

Robust BEV Map Segmentation

Latest papers with no code

Most implemented Social Latest No code

A Point-Neighborhood Learning Framework for Nasal Endoscope Image Segmentation

no code yet • 30 May 2024

In this paper, we propose a weakly semi-supervised method called Point-Neighborhood Learning (PNL) framework.

Paper
Add Code

Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes

no code yet • 30 May 2024

Thanks to the application of deep learning technology in point cloud processing of the remote sensing field, point cloud segmentation has become a research hotspot in recent years, which can be applied to real-world 3D, smart cities, and other fields.

Paper
Add Code

CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation

no code yet • 30 May 2024

Accurate detection of colorectal cancer and early prevention heavily rely on precise polyp identification during gastrointestinal colonoscopy.

Paper
Add Code

Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

no code yet • 29 May 2024

We design a simple baseline method, Reasoning3D, with the capability to understand and execute complex commands for (fine-grained) segmenting specific parts for 3D meshes with contextual awareness and reasoned answers for interactive segmentation.

Paper
Add Code

Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation

no code yet • 29 May 2024

Since the PEFT strategy is conducted symmetrically to the two CLIP modalities, the misalignment between them is mitigated.

Paper
Add Code

Optimizing Split Points for Error-Resilient SplitFed Learning

no code yet • 29 May 2024

Recent advancements in decentralized learning, such as Federated Learning (FL), Split Learning (SL), and Split Federated Learning (SplitFed), have expanded the potentials of machine learning.

Paper
Add Code

Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation

no code yet • 29 May 2024

During incrementally learning novel classes, the data distribution of old classes will be destroyed, leading to catastrophic forgetting.

Paper
Add Code

Lifelong Learning Using a Dynamically Growing Tree of Sub-networks for Domain Generalization in Video Object Segmentation

no code yet • 29 May 2024

However, when DGT is evaluated using in-domain multi-sources, the results show superior performance compared to state-of-the-art video object segmentation and other lifelong learning techniques with an average performance increase in the F-score of 6. 9% with minimal catastrophic forgetting.

Paper
Add Code

Enabling Visual Recognition at Radio Frequency

no code yet • 29 May 2024

This paper introduces PanoRadar, a novel RF imaging system that brings RF resolution close to that of LiDAR, while providing resilience against conditions challenging for optical signals.

Paper
Add Code

Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation

no code yet • 28 May 2024

In this paper, we propose shortcut mitigating augmentation (SMA) for WSSS, which generates synthetic representations of object-background combinations not seen in the training data to reduce the use of shortcut features.

Paper
Add Code

Semantic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result