Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Libraries

Use these libraries to find Scene Parsing models and implementations

Latest papers with no code

Treasure What You Have: Exploiting Similarity in Deep Neural Networks for Efficient Video Processing

no code yet • 10 May 2023

Deep learning has enabled various Internet of Things (IoT) applications.

Local and Global Contextual Features Fusion for Pedestrian Intention Prediction

no code yet • 1 May 2023

The pedestrian features include body pose and local context features that represent the pedestrian's behaviour.

Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving

no code yet • CVPR 2023

To this end, we propose a two-stage weakly supervised approach, where the segmentation model trained with the incomplete binary masks in Stage1 will facilitate the self-supervised learning of the motion prediction network in Stage2 by estimating possible moving foregrounds in advance.

Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing

no code yet • ICCV 2023

Re:PolyWorld not only outperforms the original model on building extraction in aerial images, thanks to the proposed joint analysis of vertices and edges, but also beats the state-of-the-art in multiple other domains.

Visual Traffic Knowledge Graph Generation from Scene Images

no code yet • ICCV 2023

Although previous works on traffic scene understanding have achieved great success, most of them stop at a lowlevel perception stage, such as road segmentation and lane detection, and few concern high-level understanding.

Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection

no code yet • 10 Dec 2022

Most multi-modal 3D object detection frameworks integrate semantic knowledge from 2D images into 3D LiDAR point clouds to enhance detection accuracy.

GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing

no code yet • journal 2022

RGB-T (red–green–blue and thermal) scene parsing has recently drawn considerable research attention.

Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation

no code yet • 1 Mar 2022

Image semantic segmentation aims at the pixel-level classification of images, which has requirements for both accuracy and speed in practical application.

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

no code yet • 6 Jan 2022

Finally, we perform ASP by unifying the tile-level scene classification and object-based image analysis to achieve pixel-wise semantic labeling.

ESCNet: Gaze Target Detection With the Understanding of 3D Scenes

no code yet • CVPR 2022

This paper aims to address the single image gaze target detection problem.