Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Libraries

Use these libraries to find Scene Parsing models and implementations

HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion

LiJiahang617/HAPNet 4 Apr 2024

In this study, we take one step toward this new research area by exploring a feasible strategy to fully exploit VFM features for RGB-thermal scene parsing.

1
04 Apr 2024

Robust Shape Fitting for 3D Scene Abstraction

fkluger/cuboids_revisited 15 Mar 2024

A RANSAC estimator guided by a neural network fits these primitives to a depth map.

36
15 Mar 2024

Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing

chdyshli/unsupervised-road-parsing 5 Feb 2024

There are two challenges presented in parsing road scenes from UAV images: the complexity of processing high-resolution images and the dependency on extensive manual annotations required by traditional supervised deep learning methods to train robust and accurate models.

1
05 Feb 2024

A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing

KangchengLiu/RM3D 3 Dec 2023

More importantly, we innovatively propose to learn to merge the over-divided clusters based on the local low-level geometric property similarities and the learned high-level feature similarities supervised by weak labels.

26
03 Dec 2023

Generalized Label-Efficient 3D Scene Parsing via Hierarchical Feature Aligned Pre-Training and Region-Aware Fine-tuning

maudzung/SFA3D 1 Dec 2023

Deep neural network models have achieved remarkable progress in 3D scene understanding while trained in the closed-set setting and with full labels.

957
01 Dec 2023

Improving Panoptic Segmentation for Nighttime or Low-Illumination Urban Driving Scenes

ankur-chr/panoptic_segmentation_nighttime_urban_driving_scenes 23 Jun 2023

In this work, we propose two new methods, first to improve the performance, and second to improve the robustness of panoptic segmentation in nighttime or poor illumination urban driving scenes using a domain translation approach.

2
23 Jun 2023

RT-K-Net: Revisiting K-Net for Real-Time Panoptic Segmentation

markusschoen/rt-k-net 2 May 2023

Our resulting RT-K-Net sets a new state-of-the-art performance for real-time panoptic segmentation methods on the Cityscapes dataset and shows promising results on the challenging Mapillary Vistas dataset.

12
02 May 2023

DPF: Learning Dense Prediction Fields with Weak Supervision

cxx226/dpf CVPR 2023

We showcase the effectiveness of DPFs using two substantially different tasks: high-level semantic parsing and low-level intrinsic image decomposition.

40
29 Mar 2023

Traffic Scene Parsing through the TSP6K Dataset

pengtaojiang/tsp6k 6 Mar 2023

To date, most existing datasets focus on autonomous driving scenes.

10
06 Mar 2023

Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction

mlpc-ucsd/uni-3d ICCV 2023

Performing holistic 3D scene understanding from a single-view observation, involving generating instance shapes and 3D scene segmentation, is a long-standing challenge.

16
01 Jan 2023