Sound Event Detection

74 papers with code • 4 benchmarks • 18 datasets

Sound Event Detection (SED) is the task of recognizing the sound events and their respective temporal start and end time in a recording. Sound events in real life do not always occur in isolation, but tend to considerably overlap with each other. Recognizing such overlapping sound events is referred as polyphonic SED.

Source: A report on sound event detection with different binaural features

Benchmarks

Add a Result

These leaderboards are used to track progress in Sound Event Detection

Dataset	Best Model	Compare
DESED	ATST-SED	See all
L3DAS21	PHC SEDnet n=2	See all
Mivia Audio Events	DENet	See all
Mivia Road Events	DENet	See all

Libraries

Use these libraries to find Sound Event Detection models and implementations

Kikyo-16/Sound_event_detection

3 papers

119

fgnt/sed_scores_eval

2 papers

Datasets

Most implemented papers

Most implemented Social Latest No code

Guided learning for weakly-labeled semi-supervised sound event detection

Kikyo-16/Sound_event_detection • • 6 Jun 2019

Instead of designing a single model by considering a trade-off between the two sub-targets, we design a teacher model aiming at audio tagging to guide a student model aiming at boundary detection to learn using the unlabeled data.

Paper
Code

Evaluation of post-processing algorithms for polyphonic sound event detection

topel/dcase19-RCNN-task4 • • 17 Jun 2019

We compared post-processing algorithms on the temporal prediction curves of two models: one based on the challenge's baseline and a Multiple Instance Learning (MIL) model.

Paper
Code

Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling

dr-costas/SEDLM • • Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 2019

On the contrary, with our method there is a decrease of 4% at F1 score and an increase of 7% at ER for the TUT-SED Synthetic 2016 dataset.

Paper
Code

City classification from multiple real-world sound scenes

drylbear/soundscapeCityClassification • 29 Jul 2019

In this paper, we undertake the task of automatic city classification to ask whether we can recognize a city from a set of sound scenes?

Paper
Code

Guided Learning Convolution System for DCASE 2019 Task 4

Kikyo-16/Sound_event_detection • • 11 Sep 2019

In this paper, we describe in detail the system we submitted to DCASE2019 task 4: sound event detection (SED) in domestic environments.

Paper
Code

Musical Instrument Playing Technique Detection Based on FCN: Using Chinese Bowed-Stringed Instrument as an Example

water45wzh/MIPTD_Erhu • • 20 Oct 2019

The effectiveness of the proposed framework is tested on a new dataset, its categorization of techniques is similar to our training dataset.

Paper
Code

Sound event detection in domestic environments withweakly labeled data and soundscape synthesis

turpaultn/DCASE2019_task4 • • 26 Oct 2019

This paper presents Task 4 of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 challenge and provides a first analysis of the challenge results.

Paper
Code

Sound Event Detection with Depthwise Separable and Dilated Convolutions

dr-costas/dnd-sed • • 2 Feb 2020

The number of the channels of the CNNs and size of the weight matrices of the RNNs have a direct effect on the total amount of parameters of the SED method, which is to a couple of millions.

Paper
Code

Memory Controlled Sequential Self Attention for Sound Recognition

arjunp17/MemoryControlled-MultiheadSelfAtt • 13 May 2020

In this paper we investigate the importance of the extent of memory in sequential self attention for sound recognition.

Paper
Code

Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection

soham97/MTL_Weakly_labelled_audio_data • • 17 Aug 2020

Weakly Labelled learning has garnered lot of attention in recent years due to its potential to scale Sound Event Detection (SED) and is formulated as Multiple Instance Learning (MIL) problem.

Paper
Code

Sound Event Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result