Scene Classification

122 papers with code • 2 benchmarks • 21 datasets

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Classification

Trend	Dataset	Best Model	Paper	Code	Compare
	UC Merced Land Use Dataset	µ2Net+ (ViT-L/16)			See all
	Places365-Standard	WaveMix			See all

Datasets

Latest papers

Most implemented Social Latest No code

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

jishengbai/audiolog • • 21 Nov 2023

This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.

21 Nov 2023

Paper
Code

CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision

aymanbegh/cd-coco • 12 Nov 2023

These new local distortions are generated by considering the scene context of the images that guarantees a high level of photo-realism.

12 Nov 2023

Paper
Code

Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

yuanbo2020/ergl • 5 Oct 2023

The results show the feasibility of recognizing diverse acoustic scenes based on the audio event-relational graph.

05 Oct 2023

Paper
Code

Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification

eihw/asc_sharpness • • 28 Sep 2023

The correlation between the sharpness of loss minima and generalisation in the context of deep neural networks has been subject to discussion for a long time.

28 Sep 2023

Paper
Code

DeCUR: decoupling common & unique representations for multimodal self-supervision

zhu-xlab/dino-mm • • 11 Sep 2023

We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning.

11 Sep 2023

Paper
Code

SOAR: Scene-debiasing Open-set Action Recognition

yhZhai/SOAR • • ICCV 2023

The former prevents the decoder from reconstructing the video background given video features, and thus helps reduce the background information in feature learning.

03 Sep 2023

Paper
Code

Efficient Multi-Task Scene Analysis with RGB-D Transformers

tui-nicr/nicr-scene-analysis-datasets • • 8 Jun 2023

However, we show that the dual CNN-based encoder of EMSANet can be replaced with a single Transformer-based encoder.

08 Jun 2023

Paper
Code

Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes

masuqiang/mcfa-pytorch • • 31 May 2023

To address the zero-shot image scene classification, the cross-modal feature alignment methods have been proposed in recent years.

31 May 2023

Paper
Code

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

themoro/diraugmentation • • 12 May 2023

However, we also show that DIR augmentation and Freq-MixStyle are complementary, achieving a new state-of-the-art performance on signals recorded by devices unseen during training.

12 May 2023

Paper
Code

Vision-Language Models in Remote Sensing: Current Progress and Future Trends

lzw-lzw/awesome-remote-sensing-vision-language-models • • 9 May 2023

Existing AI-related research in remote sensing primarily focuses on visual understanding tasks while neglecting the semantic understanding of the objects and their relationships.

09 May 2023

Paper
Code

Scene Classification

Benchmarks Add a Result

Datasets

Latest papers

Content

Benchmarks

Add a Result