Scene Classification

122 papers with code • 2 benchmarks • 21 datasets

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

jishengbai/audiolog 21 Nov 2023

This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.

2
21 Nov 2023

CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision

aymanbegh/cd-coco 12 Nov 2023

These new local distortions are generated by considering the scene context of the images that guarantees a high level of photo-realism.

8
12 Nov 2023

Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

yuanbo2020/ergl 5 Oct 2023

The results show the feasibility of recognizing diverse acoustic scenes based on the audio event-relational graph.

8
05 Oct 2023

Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification

eihw/asc_sharpness 28 Sep 2023

The correlation between the sharpness of loss minima and generalisation in the context of deep neural networks has been subject to discussion for a long time.

1
28 Sep 2023

DeCUR: decoupling common & unique representations for multimodal self-supervision

zhu-xlab/dino-mm 11 Sep 2023

We propose Decoupling Common and Unique Representations (DeCUR), a simple yet effective method for multimodal self-supervised learning.

21
11 Sep 2023

SOAR: Scene-debiasing Open-set Action Recognition

yhZhai/SOAR ICCV 2023

The former prevents the decoder from reconstructing the video background given video features, and thus helps reduce the background information in feature learning.

7
03 Sep 2023

Efficient Multi-Task Scene Analysis with RGB-D Transformers

tui-nicr/nicr-scene-analysis-datasets 8 Jun 2023

However, we show that the dual CNN-based encoder of EMSANet can be replaced with a single Transformer-based encoder.

13
08 Jun 2023

Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes

masuqiang/mcfa-pytorch 31 May 2023

To address the zero-shot image scene classification, the cross-modal feature alignment methods have been proposed in recent years.

6
31 May 2023

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

themoro/diraugmentation 12 May 2023

However, we also show that DIR augmentation and Freq-MixStyle are complementary, achieving a new state-of-the-art performance on signals recorded by devices unseen during training.

10
12 May 2023

Vision-Language Models in Remote Sensing: Current Progress and Future Trends

lzw-lzw/awesome-remote-sensing-vision-language-models 9 May 2023

Existing AI-related research in remote sensing primarily focuses on visual understanding tasks while neglecting the semantic understanding of the objects and their relationships.

84
09 May 2023