Scene Recognition

64 papers with code • 8 benchmarks • 15 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Recognition

Dataset	Best Model	Compare
YUP++	DEEP-HAL with ODF+SDF (I3D)	See all
MIT Indoor Scenes	FOSNet	See all
AID	AGOS	See all
SUN-RGBD	OMNIVORE (Swin-B)	See all
Places365	FOSNet	See all
SUN397	FOSNet	See all
ScanNet	SSMA	See all
ADE20K	Semantic-Aware Scene Recogniton (ResNet-18)	See all

Datasets

Most implemented papers

Most implemented Social Latest No code

Omnivore: A Single Model for Many Visual Modalities

facebookresearch/omnivore • • CVPR 2022

Prior work has studied different visual modalities in isolation and developed separate architectures for recognition of images, videos, and 3D data.

Paper
Code

An Empirical Study of Remote Sensing Pretraining

vitae-transformer/vitae-transformer-remote-sensing • 6 Apr 2022

To this end, we train different networks from scratch with the help of the largest RS scene recognition dataset up to now -- MillionAID, to obtain a series of RS pretrained backbones, including both convolutional neural networks (CNN) and vision transformers such as Swin and ViTAE, which have shown promising performance on computer vision tasks.

Paper
Code

Object Detectors Emerge in Deep Scene CNNs

JepsonWong/CNN_Visualization • 22 Dec 2014

With the success of new computational architectures for visual processing, such as convolutional neural networks (CNN) and access to image databases with millions of labeled examples (e. g., ImageNet, Places), the state of the art in computer vision is advancing rapidly.

Paper
Code

Learning image representations tied to ego-motion

tu-rbo/concarne • ICCV 2015

Understanding how images of objects and scenes behave in response to specific ego-motions is a crucial aspect of proper visual development, yet existing visual learning methods are conspicuously disconnected from the physical source of their images.

Paper
Code

Deep Filter Banks for Texture Recognition and Segmentation

mcimpoi/deep-fbanks • CVPR 2015

Research in texture recognition often concentrates on the problem of material recognition in uncluttered conditions, an assumption rarely met by applications.

Paper
Code

Classifying Variable-Length Audio Files with All-Convolutional Networks and Masked Global Pooling

numpde/phonepad • • 11 Jul 2016

We trained a deep all-convolutional neural network with masked global pooling to perform single-label classification for acoustic scene classification and multi-label classification for domestic audio tagging in the DCASE-2016 contest.

Paper
Code

Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition

wangzheallen/vsad • 1 Sep 2016

In this paper, we propose a hybrid representation, which leverages the discriminative capacity of CNNs and the simplicity of descriptor encoding schema for image recognition, with a focus on scene recognition.

Paper
Code