Acoustic Scene Classification

37 papers with code • 5 benchmarks • 10 datasets

The goal of acoustic scene classification is to classify a test recording into one of the provided predefined classes that characterizes the environment in which it was recorded.

Source: DCASE 2019 Source: DCASE 2018

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

jishengbai/icme2024asc 5 Feb 2024

In addition, considering the abundance of unlabeled acoustic scene data in the real world, it is important to study the possible ways to utilize these unlabelled data.

16
05 Feb 2024

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

jishengbai/audiolog 21 Nov 2023

This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.

2
21 Nov 2023

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

alibaba-damo-academy/FunASR 14 Nov 2023

Recently, instruction-following audio-language models have received broad attention for audio interaction with humans.

3,115
14 Nov 2023

Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

yuanbo2020/ergl 5 Oct 2023

The results show the feasibility of recognizing diverse acoustic scenes based on the audio event-relational graph.

8
05 Oct 2023

Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification

eihw/asc_sharpness 28 Sep 2023

The correlation between the sharpness of loss minima and generalisation in the context of deep neural networks has been subject to discussion for a long time.

1
28 Sep 2023

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

themoro/diraugmentation 12 May 2023

However, we also show that DIR augmentation and Freq-MixStyle are complementary, achieving a new state-of-the-art performance on signals recorded by devices unseen during training.

9
12 May 2023

Unsupervised Improvement of Audio-Text Cross-Modal Representations

zhepeiw/clap_curation 3 May 2023

In this paper, we study unsupervised approaches to improve the learning framework of such representations with unpaired text and audio.

4
03 May 2023

CochlScene: Acquisition of acoustic scene data using crowdsourcing

cochlearai/cochlscene 4 Nov 2022

This paper describes a pipeline for collecting acoustic scene data by using crowdsourcing.

3
04 Nov 2022

Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification

yuanbo2020/ergl 27 Oct 2022

Experiments on a polyphonic acoustic scene dataset show that the proposed ERGL achieves competitive performance on ASC by using only a limited number of embeddings of audio events without any data augmentations.

8
27 Oct 2022

Efficient Similarity-based Passive Filter Pruning for Compressing CNNs

arshdeep-singh-boparai/efficient_similarity_pruning_algo 27 Oct 2022

However, the computational complexity of computing the pairwise similarity matrix is high, particularly when a convolutional layer has many filters.

3
27 Oct 2022