Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Parsing

Trend	Dataset	Best Model	Paper	Code	Compare
	PGDP5K	PGDPNet			See all
	Cityscapes test	VCD No Coarse			See all

Libraries

Use these libraries to find Scene Parsing models and implementations

PaddlePaddle/PaddleSeg

4 papers

8,226

open-mmlab/mmsegmentation

3 papers

7,370

CSAILVision/semantic-segmentation-p…

2 papers

4,834

sithu31296/semantic-segmentation

2 papers

751

See all 5 libraries.

Datasets

Subtasks

Scene Recognition

Face Parsing

Indoor Scene Synthesis

Indoor Scene Reconstruction

Scene Labeling

Street Scene Parsing

Most implemented papers

Most implemented Social Latest No code

Context-Aware Synthesis and Placement of Object Instances

NVlabs/Instance_Insertion • • NeurIPS 2018

Learning to insert an object instance into an image in a semantically coherent manner is a challenging and interesting problem.

Paper
Code

GFF: Gated Fully Fusion for Semantic Segmentation

lxtGH/DecoupleSegNets • • 3 Apr 2019

Semantic segmentation generates comprehensive understanding of scenes through densely predicting the category for each pixel.

Paper
Code

SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications

PengyiZhang/SlimYOLOv3 • • 25 Jul 2019

Drones or general Unmanned Aerial Vehicles (UAVs), endowed with computer vision function by on-board cameras and embedded systems, have become popular in a wide range of applications.

Paper
Code

Dynamic Multi-Scale Filters for Semantic Segmentation

PaddlePaddle/PaddleSeg • • ICCV 2019

DMNet is composed of multiple Dynamic Convolutional Modules (DCMs) arranged in parallel, each of which exploits context-aware filters to estimate semantic representation for a specific scale.

Paper
Code

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

Andrew-Qibin/SPNet • • CVPR 2020

Spatial pooling has been proven highly effective in capturing long-range contextual information for pixel-wise prediction tasks, such as scene parsing.

Paper
Code

CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

hkchengrex/CascadePSP • • CVPR 2020

In this paper, we propose a novel approach to address the high-resolution segmentation problem without using any high-resolution training data.

Paper
Code

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

charlesCXK/RGBD_Semantic_Segmentation_PyTorch • • ECCV 2020

In this paper, we propose a novel operator called malleable 2. 5D convolution to learn the receptive field along the depth-axis.

Paper
Code

Minimal Solvers for Single-View Lens-Distorted Camera Auto-Calibration

ylochman/single-view-autocalib • 17 Nov 2020

This paper proposes minimal solvers that use combinations of imaged translational symmetries and parallel scene lines to jointly estimate lens undistortion with either affine rectification or focal length and absolute orientation.

Paper
Code

Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene Graphs

MIT-SPARK/Kimera • • 18 Jan 2021

This mental model captures geometric and semantic aspects of the scene, describes the environment at multiple levels of abstractions (e. g., objects, rooms, buildings), includes static and dynamic entities and their relations (e. g., a person is in a room at a given time).

Paper
Code

Mesh Convolution with Continuous Filters for 3D Surface Parsing

enyahermite/picasso • • 3 Dec 2021

In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.

Paper
Code

Scene Parsing

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result