Scene Segmentation

121 papers with code • 5 benchmarks • 7 datasets

Scene segmentation is the task of splitting a scene into its various object components.

Image adapted from Temporally coherent 4D reconstruction of complex dynamic scenes.

Libraries

Use these libraries to find Scene Segmentation models and implementations
3 papers
2,924
3 papers
1,696
See all 6 libraries.

VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions

patrick-tssn/VSTAR 30 May 2023

Video-grounded dialogue understanding is a challenging problem that requires machine to perceive, parse and reason over situated semantics extracted from weakly aligned video and dialogues.

11
30 May 2023

SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery

lalithjets/surgicalgpt 19 Apr 2023

Given the limitations of unidirectional attention in GPT models and their ability to generate coherent long paragraphs, we carefully sequence the word tokens before vision tokens, mimicking the human thought process of understanding the question to infer an answer from an image.

17
19 Apr 2023

FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

uark-cviu/fredom CVPR 2023

Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed.

10
04 Apr 2023

Self-positioning Point-based Transformer for Point Cloud Understanding

mlvlab/spotr CVPR 2023

In this paper, we present a Self-Positioning point-based Transformer (SPoTr), which is designed to capture both local and global shape contexts with reduced complexity.

80
29 Mar 2023

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

tencentyouturesearch/highlightdetection-clc CVPR 2023

Based on existing efforts, this work has two observations: (1) For different annotators, labeling highlight has uncertainty, which leads to inaccurate and time-consuming annotations.

17
26 Mar 2023

Neural Implicit Vision-Language Feature Fields

ethz-asl/autolabel 20 Mar 2023

In this work, we present a zero-shot volumetric open-vocabulary semantic scene segmentation method.

42
20 Mar 2023

Semantic segmentation of surgical hyperspectral images under geometric domain shifts

imsy-dkfz/htc 20 Mar 2023

According to a comprehensive validation on six different OOD data sets comprising 600 RGB and hyperspectral imaging (HSI) cubes from 33 pigs semantically annotated with 19 classes, we demonstrate a large performance drop of SOA organ segmentation networks applied to geometric OOD data.

24
20 Mar 2023

Towards Surgical Context Inference and Translation to Gestures

uva-dsa/auto_surgical_context2gesture 28 Feb 2023

We evaluate the performance of each stage of our method by comparing the results with the ground truth segmentation masks, the consensus context labels, and the gesture labels in the JIGSAWS dataset.

0
28 Feb 2023

Paced-Curriculum Distillation with Prediction and Label Uncertainty for Image Segmentation

mobarakol/p-cd 2 Feb 2023

Purpose: In curriculum learning, the idea is to train on easier samples first and gradually increase the difficulty, while in self-paced learning, a pacing function defines the speed to adapt the training progress.

10
02 Feb 2023

Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction

mlpc-ucsd/uni-3d ICCV 2023

Performing holistic 3D scene understanding from a single-view observation, involving generating instance shapes and 3D scene segmentation, is a long-standing challenge.

17
01 Jan 2023