Sound Event Detection

74 papers with code • 4 benchmarks • 18 datasets

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Libraries

Use these libraries to find Sound Event Detection models and implementations

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

ttgeng233/UniAV 4 Apr 2024

Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL).

3
04 Apr 2024

Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection

c4dm/dcase-few-shot-bioacoustic 27 Mar 2024

A recent development in the field is the introduction of the task known as few-shot bioacoustic sound event detection, which aims to train a versatile animal sound detector using only a small set of audio samples.

44
27 Mar 2024

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection

Harper812/FFDConv 10 Jan 2024

Recently, 2D convolution has been found unqualified in sound event detection (SED).

6
10 Jan 2024

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

dberghi/av-seld 14 Dec 2023

Sound event localization and detection (SELD) combines two subtasks: sound event detection (SED) and direction of arrival (DOA) estimation.

8
14 Dec 2023

w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training

orlllem/seld_wav2vec2 12 Dec 2023

By applying this approach to SELD, we can leverage a substantial amount of unlabeled 3D audio data to learn robust representations of sound events and their locations.

5
12 Dec 2023

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

jishengbai/audiolog 21 Nov 2023

This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.

2
21 Nov 2023

Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems

ronfrancesca/sed-carbon-footprint 5 Oct 2023

In recent years, deep learning systems have shown a concerning trend toward increased complexity and higher energy consumption.

5
05 Oct 2023

Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection

ilyassmoummad/rcl_fs_bsed 16 Sep 2023

Bioacoustic sound event detection allows for better understanding of animal behavior and for better monitoring biodiversity using audio.

3
16 Sep 2023

Fine-tune the pretrained ATST model for sound event detection

Audio-WestlakeU/ATST-SED 15 Sep 2023

In this work, we study the fine-tuning method of the pretrained models for SED.

39
15 Sep 2023

Leveraging Geometrical Acoustic Simulations of Spatial Room Impulse Responses for Improved Sound Event Detection and Localization

ChrisIck/DCASE_Synth_Data 6 Sep 2023

As deeper and more complex models are developed for the task of sound event localization and detection (SELD), the demand for annotated spatial audio data continues to increase.

2
06 Sep 2023