Search Results for author: Toni Heittola

Found 13 papers, 6 papers with code

Positive and negative sampling strategies for self-supervised learning on audio-video data

1 code implementation5 Feb 2024 Shanshan Wang, Soumya Tripathy, Toni Heittola, Annamaria Mesaros

In Self-Supervised Learning (SSL), Audio-Visual Correspondence (AVC) is a popular task to learn deep audio and video features from large unlabeled datasets.

Self-Supervised Learning

Zero-Shot Audio Classification using Image Embeddings

no code implementations10 Jun 2022 Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen

The results show that the classification performance is highly sensitive to the semantic relation between test and training classes and textual and image embeddings can reach up to the semantic acoustic embeddings when the seen and unseen classes are semantically similar.

Audio Classification Zero-shot Audio Classification +1

Low-complexity acoustic scene classification in DCASE 2022 Challenge

no code implementations8 Jun 2022 Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen

The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46. 5 K parameters, and 29. 23 million multiply-and-accumulate operations (MMACs).

Acoustic Scene Classification Classification +2

Sound Event Detection: A Tutorial

1 code implementation12 Jul 2021 Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley

The goal of automatic sound event detection (SED) methods is to recognize what is happening in an audio signal and when it is happening.

BIG-bench Machine Learning Event Detection +1

Low-complexity acoustic scene classification for multi-device audio: analysis of DCASE 2021 Challenge systems

1 code implementation28 May 2021 Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen

The most used techniques among the submissions were residual networks and weight quantization, with the top systems reaching over 70% accuracy, and log loss under 0. 8.

Acoustic Scene Classification Quantization +1

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019

4 code implementations6 Sep 2020 Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen

A large-scale realistic dataset of spatialized sound events was generated for the challenge, to be used for training of learning-based approaches, and for evaluation of the submissions in an unlabeled subset.

Data Augmentation Sound Event Localization and Detection

Active Learning for Sound Event Detection

no code implementations12 Feb 2020 Shuyang Zhao, Toni Heittola, Tuomas Virtanen

Training with recordings as context outperforms training with only annotated segments.

Active Learning Change Point Detection +2

Acoustic Scene Classification: A Competition Review

no code implementations2 Aug 2018 Shayan Gharib, Honain Derrar, Daisuke Niizumi, Tuukka Senttula, Janne Tommola, Toni Heittola, Tuomas Virtanen, Heikki Huttunen

In this paper we study the problem of acoustic scene classification, i. e., categorization of audio sequences into mutually exclusive classes based on their spectral content.

Acoustic Scene Classification Classification +2

A multi-device dataset for urban acoustic scene classification

2 code implementations25 Jul 2018 Annamaria Mesaros, Toni Heittola, Tuomas Virtanen

This paper introduces the acoustic scene classification task of DCASE 2018 Challenge and the TUT Urban Acoustic Scenes 2018 dataset provided for the task, and evaluates the performance of a baseline system in the task.

Acoustic Scene Classification Classification +1

Sound Event Detection in Multichannel Audio Using Spatial and Harmonic Features

no code implementations7 Jun 2017 Sharath Adavanne, Giambattista Parascandolo, Pasi Pertilä, Toni Heittola, Tuomas Virtanen

In this paper, we propose the use of spatial and harmonic features in combination with long short term memory (LSTM) recurrent neural network (RNN) for automatic sound event detection (SED) task.

Event Detection Sound Event Detection

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

1 code implementation21 Feb 2017 Emre Çakır, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen

Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure.

Event Detection Sound Event Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.