Scene Understanding

511 papers with code • 3 benchmarks • 43 datasets

Scene Understanding is something that to understand a scene. For instance, iPhone has function that help eye disabled person to take a photo by discribing what the camera sees. This is an example of Scene Understanding.

Libraries

Use these libraries to find Scene Understanding models and implementations
4 papers
2,917
4 papers
1,108
See all 5 libraries.

Most implemented papers

Boundary-Seeking Generative Adversarial Networks

eriklindernoren/PyTorch-GAN 27 Feb 2017

We introduce a method for training GANs with discrete data that uses the estimated difference measure from the discriminator to compute importance weights for generated samples, thus providing a policy gradient for training the generator.

Multi-Task Learning as Multi-Objective Optimization

IntelVCL/MultiObjectiveOptimization NeurIPS 2018

These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks.

ShelfNet for Fast Semantic Segmentation

juntang-zhuang/ShelfNet 27 Nov 2018

Compared with real-time segmentation models such as BiSeNet, our model achieves higher accuracy at comparable speed on the Cityscapes Dataset, enabling the application in speed-demanding tasks such as street-scene understanding for autonomous driving.

From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network

sshaoshuai/PointCloudDet3D 8 Jul 2019

3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications.

Adapting Deep Network Features to Capture Psychological Representations

kbraunlich/contort_DNN 6 Aug 2016

To remedy this, we develop a method for adapting deep features to align with human similarity judgments, resulting in image representations that can potentially be used to extend the scope of psychological experiments.

SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

PRBonn/semantic-kitti-api ICCV 2019

Despite the relevance of semantic scene understanding for this application, there is a lack of a large dataset for this task which is based on an automotive LiDAR.

Towards Ghost-free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN

vinthony/ghost-free-shadow-removal 20 Nov 2019

With the help of novel masks or scenes, we enhance the current datasets using synthesized shadow images.

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

xiaofengShi/CHINESE-OCR 26 Jan 2016

The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images.

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

facebookresearch/TensorComprehensions 13 Feb 2018

Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with applications in automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc.