Search Results for author: Annamaria Mesaros

Found 19 papers, 6 papers with code

Sound Event Detection and Localization with Distance Estimation

no code implementations18 Mar 2024 Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros

Sound Event Detection and Localization (SELD) is a combined task of identifying sound events and their corresponding direction-of-arrival (DOA).

Event Detection Sound Event Detection

Positive and negative sampling strategies for self-supervised learning on audio-video data

1 code implementation5 Feb 2024 Shanshan Wang, Soumya Tripathy, Toni Heittola, Annamaria Mesaros

In Self-Supervised Learning (SSL), Audio-Visual Correspondence (AVC) is a popular task to learn deep audio and video features from large unlabeled datasets.

Self-Supervised Learning

Class-Incremental Learning for Multi-Label Audio Classification

no code implementations9 Jan 2024 Manjunath Mulimani, Annamaria Mesaros

Experiments are performed on a dataset with 50 sound classes, with an initial classification task containing 30 base classes and 4 incremental phases of 5 classes each.

Audio Classification Class Incremental Learning +2

Evaluating Classification Systems Against Soft Labels with Fuzzy Precision and Recall

no code implementations25 Sep 2023 Manu Harju, Annamaria Mesaros

Classification systems are normally trained by minimizing the cross-entropy between system outputs and reference labels, which makes the Kullback-Leibler divergence a natural choice for measuring how closely the system can follow the data.

Event Detection Sound Event Detection

Incremental Learning of Acoustic Scenes and Sound Events

no code implementations28 Feb 2023 Manjunath Mulimani, Annamaria Mesaros

At the same time, its performance on the previous ASC task decreases only by 5. 1 percentage points due to the additional learning of the AT task.

Acoustic Scene Classification Audio Tagging +3

Self-supervised learning of audio representations using angular contrastive loss

1 code implementation10 Nov 2022 Shanshan Wang, Soumya Tripathy, Annamaria Mesaros

To improve the discriminative ability of feature embeddings in SSL, we propose a new loss function called Angular Contrastive Loss (ACL), a linear combination of angular margin and contrastive loss.

Contrastive Learning Self-Supervised Learning

Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification

no code implementations13 Sep 2022 Daniel Aleksander Krause, Annamaria Mesaros

Sound event detection (SED) and Acoustic scene classification (ASC) are two widely researched audio tasks that constitute an important part of research on acoustic scene analysis.

Acoustic Scene Classification Event Detection +2

Low-complexity acoustic scene classification in DCASE 2022 Challenge

no code implementations8 Jun 2022 Irene Martín-Morató, Francesco Paissan, Alberto Ancilotto, Toni Heittola, Annamaria Mesaros, Elisabetta Farella, Alessio Brutti, Tuomas Virtanen

The provided baseline system is a convolutional neural network which employs post-training quantization of parameters, resulting in 46. 5 K parameters, and 29. 23 million multiply-and-accumulate operations (MMACs).

Acoustic Scene Classification Classification +2

Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio

no code implementations26 Jul 2021 Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros

Finally, we propose various ways of combining the proximity and direction estimation problems into a joint task providing temporal information about the onsets and offsets of the appearing sources.

Crowdsourcing strong labels for sound event detection

no code implementations26 Jul 2021 Irene Martín-Morató, Manu Harju, Annamaria Mesaros

Strong labels are a necessity for evaluation of sound event detection methods, but often scarcely available due to the high resources required by the annotation task.

Event Detection Sound Event Detection

Sound Event Detection: A Tutorial

1 code implementation12 Jul 2021 Annamaria Mesaros, Toni Heittola, Tuomas Virtanen, Mark D. Plumbley

The goal of automatic sound event detection (SED) methods is to recognize what is happening in an audio signal and when it is happening.

BIG-bench Machine Learning Event Detection +1

Low-complexity acoustic scene classification for multi-device audio: analysis of DCASE 2021 Challenge systems

1 code implementation28 May 2021 Irene Martín-Morató, Toni Heittola, Annamaria Mesaros, Tuomas Virtanen

The most used techniques among the submissions were residual networks and weight quantization, with the top systems reaching over 70% accuracy, and log loss under 0. 8.

Acoustic Scene Classification Quantization +1

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019

4 code implementations6 Sep 2020 Archontis Politis, Annamaria Mesaros, Sharath Adavanne, Toni Heittola, Tuomas Virtanen

A large-scale realistic dataset of spatialized sound events was generated for the challenge, to be used for training of learning-based approaches, and for evaluation of the submissions in an unlabeled subset.

Data Augmentation Sound Event Localization and Detection

A multi-device dataset for urban acoustic scene classification

2 code implementations25 Jul 2018 Annamaria Mesaros, Toni Heittola, Tuomas Virtanen

This paper introduces the acoustic scene classification task of DCASE 2018 Challenge and the TUT Urban Acoustic Scenes 2018 dataset provided for the task, and evaluates the performance of a baseline system in the task.

Acoustic Scene Classification Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.