Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Libraries

Use these libraries to find Scene Parsing models and implementations

OneFormer: One Transformer to Rule Universal Image Segmentation

huggingface/transformers CVPR 2023

However, such panoptic architectures do not truly unify image segmentation because they need to be trained individually on the semantic, instance, or panoptic segmentation to achieve the best performance.

124,593
10 Nov 2022

VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum Modeling

air-discover/vibus 20 Oct 2022

In the first stage, we perform self-supervised representation learning on unlabeled points with the proposed Viewpoint Bottleneck loss function.

156
20 Oct 2022

Boosting Night-time Scene Parsing with Learnable Frequency

wangsen99/FDLNet 30 Aug 2022

Based on this, we propose to exploit the image frequency distributions for night-time scene parsing.

10
30 Aug 2022

A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing

apple/ml-dms-dataset 21 Jul 2022

A key algorithm for understanding the world is material segmentation, which assigns a label (metal, glass, etc.)

45
21 Jul 2022

Plane Geometry Diagram Parsing

mingliangzhang2018/PGDP 19 May 2022

Geometry diagram parsing plays a key role in geometry problem solving, wherein the primitive extraction and relation parsing remain challenging due to the complex layout and between-primitive relationship.

32
19 May 2022

FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing

floatseg/floatseg.github.io CVPR 2022

Our framework involves independent dense prediction of object category and part attributes which increases scalability and reduces task complexity compared to the monolithic label space counterpart.

0
30 Mar 2022

TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes

GAP-LAB-CUHK-SZ/TO-Scene 17 Mar 2022

Experiments show that the algorithms trained on TO-Scene indeed work on the realistic test data, and our proposed tabletop-aware learning strategy greatly improves the state-of-the-art results on both 3D semantic segmentation and object detection tasks.

41
17 Mar 2022

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

shaohuadong2021/egfnet 9 Dec 2021

Considering the importance of high level semantic information, we propose a global information module and a semantic information module to extract rich semantic information from the high-level features.

24
09 Dec 2021

Mesh Convolution with Continuous Filters for 3D Surface Parsing

enyahermite/picasso 3 Dec 2021

In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.

121
03 Dec 2021

Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck

open-air-sun/viewpoint-bottleneck 17 Sep 2021

Semantic understanding of 3D point clouds is important for various robotics applications.

62
17 Sep 2021