Scene Segmentation

122 papers with code • 5 benchmarks • 7 datasets

Scene segmentation is the task of splitting a scene into its various object components.

Image adapted from Temporally coherent 4D reconstruction of complex dynamic scenes.

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Segmentation

Dataset	Best Model	Compare
SUN-RGBD	ICM	See all
ScanNet	3DMV	See all
StreetHazards	Mask2Anomaly	See all
NYU Depth v2	Dilated FCN-2s RGB	See all
UAVid	UNetFormer	See all

Libraries

Use these libraries to find Scene Segmentation models and implementations

PaddlePaddle/PaddleSeg

3 papers

8,357

osmr/imgclsmob

3 papers

2,926

isl-org/Open3D-ML

3 papers

1,714

open-mmlab/mmsegmentation

2 papers

7,583

See all 6 libraries.

Datasets

Subtasks

Thermal Image Segmentation

Latest papers

Most implemented Social Latest No code

Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

BiQiWHU/CMFormer • • 1 Jul 2023

Unlike domain gap challenges, USSS is unique in that the semantic categories are often similar in different urban scenes, while the styles can vary significantly due to changes in urban landscapes, weather conditions, lighting, and other factors.

01 Jul 2023

Paper
Code

VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions

patrick-tssn/VSTAR • • 30 May 2023

Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics extracted from weakly aligned video and dialogues.

30 May 2023

Paper
Code

SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery

lalithjets/surgicalgpt • • 19 Apr 2023

Given the limitations of unidirectional attention in GPT models and their ability to generate coherent long paragraphs, we carefully sequence the word tokens before vision tokens, mimicking the human thought process of understanding the question to infer an answer from an image.

19 Apr 2023

Paper
Code

FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

uark-cviu/fredom • • CVPR 2023

Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed.

04 Apr 2023

Paper
Code

Self-positioning Point-based Transformer for Point Cloud Understanding

mlvlab/spotr • • CVPR 2023

In this paper, we present a Self-Positioning point-based Transformer (SPoTr), which is designed to capture both local and global shape contexts with reduced complexity.

29 Mar 2023

Paper
Code

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

tencentyouturesearch/highlightdetection-clc • CVPR 2023

Based on existing efforts, this work has two observations: (1) For different annotators, labeling highlight has uncertainty, which leads to inaccurate and time-consuming annotations.

26 Mar 2023

Paper
Code

Neural Implicit Vision-Language Feature Fields

ethz-asl/autolabel • • 20 Mar 2023

In this work, we present a zero-shot volumetric open-vocabulary semantic scene segmentation method.

20 Mar 2023

Paper
Code

Semantic segmentation of surgical hyperspectral images under geometric domain shifts

imsy-dkfz/htc • • 20 Mar 2023

According to a comprehensive validation on six different OOD data sets comprising 600 RGB and hyperspectral imaging (HSI) cubes from 33 pigs semantically annotated with 19 classes, we demonstrate a large performance drop of SOA organ segmentation networks applied to geometric OOD data.

20 Mar 2023

Paper
Code

Towards Surgical Context Inference and Translation to Gestures

uva-dsa/auto_surgical_context2gesture • • 28 Feb 2023

We evaluate the performance of each stage of our method by comparing the results with the ground truth segmentation masks, the consensus context labels, and the gesture labels in the JIGSAWS dataset.

28 Feb 2023

Paper
Code

Paced-Curriculum Distillation with Prediction and Label Uncertainty for Image Segmentation

mobarakol/p-cd • • 2 Feb 2023

Purpose: In curriculum learning, the idea is to train on easier samples first and gradually increase the difficulty, while in self-paced learning, a pacing function defines the speed to adapt the training progress.

02 Feb 2023

Paper
Code

Scene Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result