Semantic Segmentation

5164 papers with code • 125 benchmarks • 313 datasets

Semantic Segmentation is a computer vision task in which the goal is to categorize each pixel in an image into a class or object. The goal is to produce a dense pixel-wise segmentation map of an image, where each pixel is assigned to a specific class or object. Some example benchmarks for this task are Cityscapes, PASCAL VOC and ADE20K. Models are usually evaluated with the Mean Intersection-Over-Union (Mean IoU) and Pixel Accuracy metrics.

( Image credit: CSAILVision )

Libraries

Use these libraries to find Semantic Segmentation models and implementations
53 papers
8,226
30 papers
2,917
See all 39 libraries.

ViM-UNet: Vision Mamba for Biomedical Segmentation

faceonlive/ai-research 11 Apr 2024

Here, we introduce ViM-UNet, a novel segmentation architecture based on it and compare it to UNet and UNETR for two challenging microscopy instance segmentation tasks.

124
11 Apr 2024

OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities

faceonlive/ai-research 11 Apr 2024

We present OpenTrench3D, a novel and comprehensive 3D Semantic Segmentation point cloud dataset, designed to advance research and development in underground utility surveying and mapping.

124
11 Apr 2024

Multi-view Aggregation Network for Dichotomous Image Segmentation

faceonlive/ai-research 11 Apr 2024

Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images.

124
11 Apr 2024

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

faceonlive/ai-research 8 Apr 2024

Classifier-Free Guidance (CFG) has been widely used in text-to-image diffusion models, where the CFG scale is introduced to control the strength of text guidance on the whole image space.

124
08 Apr 2024

LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image Segmentation

faceonlive/ai-research 7 Apr 2024

As a result of the rise of Transformer architectures in medical image analysis, specifically in the domain of medical image segmentation, a multitude of hybrid models have been created that merge the advantages of Convolutional Neural Networks (CNNs) and Transformers.

124
07 Apr 2024

FPL+: Filtered Pseudo Label-based Unsupervised Cross-Modality Adaptation for 3D Medical Image Segmentation

hilab-git/fpl-plus 7 Apr 2024

Adapting a medical image segmentation model to a new domain is important for improving its cross-domain transferability, and due to the expensive annotation process, Unsupervised Domain Adaptation (UDA) is appealing where only unlabeled images are needed for the adaptation.

4
07 Apr 2024

Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation

faceonlive/ai-research 6 Apr 2024

Cross-domain semantic segmentation of remote sensing (RS) imagery based on unsupervised domain adaptation (UDA) techniques has significantly advanced deep-learning applications in the geosciences.

124
06 Apr 2024

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

faceonlive/ai-research 5 Apr 2024

In this work, we introduce Sigma, a Siamese Mamba network for multi-modal semantic segmentation, utilizing the Selective Structured State Space Model, Mamba.

124
05 Apr 2024

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

amrinkareem/paris3d 4 Apr 2024

We introduce a novel segmentation task known as reasoning part segmentation for 3D objects, aiming to output a segmentation mask based on complex and implicit textual queries about specific parts of a 3D object.

2
04 Apr 2024

HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion

LiJiahang617/HAPNet 4 Apr 2024

In this study, we take one step toward this new research area by exploring a feasible strategy to fully exploit VFM features for RGB-thermal scene parsing.

1
04 Apr 2024