1 code implementation • 9 Jul 2020 • Emre Çakır, Konstantinos Drossos, Tuomas Virtanen
Audio captioning is a multi-modal task, focusing on using natural language for describing the contents of general audio.
1 code implementation • 17 Aug 2018 • Shayan Gharib, Konstantinos Drossos, Emre Çakır, Dmitriy Serdyuk, Tuomas Virtanen
A general problem in acoustic scene classification task is the mismatched conditions between training and testing data, which significantly reduces the performance of the developed methods on classification accuracy.
no code implementations • 9 May 2018 • Emre Çakır, Tuomas Virtanen
Sound event detection systems typically consist of two stages: extracting hand-crafted features from the raw audio waveform, and learning a mapping between these features and the target sound events using a classifier.
no code implementations • 7 Jun 2017 • Sharath Adavanne, Konstantinos Drossos, Emre Çakır, Tuomas Virtanen
This paper studies the detection of bird calls in audio segments using stacked convolutional and recurrent neural networks.
1 code implementation • 21 Feb 2017 • Emre Çakır, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen
Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure.