Browse > Audio > Sound Event Detection

Sound Event Detection

4 papers with code · Audio

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Adaptive pooling operators for weakly labeled sound event detection

26 Apr 2018marl/autopool

In this work, we treat SED as a multiple instance learning (MIL) problem, where training labels are static over a short excerpt, indicating the presence or absence of sound sources but not their temporal locality. We evaluate the proposed pooling operators on three datasets, and demonstrate that in each case, the proposed methods outperform non-adaptive pooling operators for static prediction, and nearly match the performance of models trained with strong, dynamic annotations.

MULTIPLE INSTANCE LEARNING SOUND EVENT DETECTION TIME SERIES

A Closer Look at Weak Label Learning for Audio Events

24 Apr 2018ankitshah009/WALNet-Weak_Label_Analysis

In this work, we first describe a CNN based approach for weakly supervised training of audio events. We then describe important characteristics, which naturally arise in weakly supervised learning of sound events.

AUDIO CLASSIFICATION SOUND EVENT DETECTION

Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings

4 Apr 2016yardencsGitHub/tweetynet

In this paper we present an approach to polyphonic sound event detection in real life recordings based on bi-directional long short term memory (BLSTM) recurrent neural networks (RNNs). A single multilabel BLSTM RNN is trained to map acoustic features of a mixture signal consisting of sounds from multiple classes, to binary activity indicators of each event class.

SOUND EVENT DETECTION

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

21 Feb 2017cchinchristopherj/Right-Whale-Unsupervised-Model

Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNN) are able to extract higher level features that are invariant to local spectral and temporal variations.

SOUND EVENT DETECTION