Search Results for author: Hannes Gamper

Found 15 papers, 11 papers with code

Rethinking Emotion Bias in Music via Frechet Audio Distance

1 code implementation23 Sep 2024 Yuanchao Li, Azalea Gui, Dimitra Emmanouilidou, Hannes Gamper

In this work, we conduct a study on Music Emotion Recognition (MER) and Emotional Music Generation (EMG), employing diverse audio encoders alongside the Frechet Audio Distance (FAD), a reference-free evaluation metric.

Emotion Recognition FAD +2

Multi-label audio classification with a noisy zero-shot teacher

1 code implementation20 Jul 2024 Sebastian Braun, Hannes Gamper

We propose a novel training scheme using self-label correction and data augmentation methods designed to deal with noisy labels and improve real-world accuracy on a polyphonic audio content detection task.

Audio Classification Data Augmentation

Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data

1 code implementation29 May 2024 Eloi Moliner, Sebastian Braun, Hannes Gamper

This paper investigates the potential of Gaussian Flow Bridges, an emerging approach in generative modeling, for this problem.

continuous-control Continuous Control

PAM: Prompting Audio-Language Models for Audio Quality Assessment

1 code implementation1 Feb 2024 Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

Here, we exploit this capability and introduce PAM, a no-reference metric for assessing audio quality for different audio processing tasks.

Audio Quality Assessment Music Generation +2

CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

no code implementations8 Dec 2023 Ruihan Yang, Hannes Gamper, Sebastian Braun

We introduce a multi-modal diffusion model tailored for the bi-directional conditional generation of video and audio.

Audio Generation

Adapting Frechet Audio Distance for Generative Music Evaluation

3 code implementations2 Nov 2023 Azalea Gui, Hannes Gamper, Sebastian Braun, Dimitra Emmanouilidou

The growing popularity of generative music models underlines the need for perceptually relevant, objective music quality metrics.

FAD

ICASSP 2023 Acoustic Echo Cancellation Challenge

1 code implementation22 Sep 2023 Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Evgenii Indenbom, Nicolae-Catalin Ristea, Jegor Gužvin, Hannes Gamper, Sebastian Braun, Robert Aichner

This is the fourth AEC challenge and it is enhanced by adding a second track for personalized acoustic echo cancellation, reducing the algorithmic + buffering latency to 20ms, as well as including a full-band version of AECMOS.

Acoustic echo cancellation Speech Enhancement

Speech MOS multi-task learning and rater bias correction

no code implementations4 Dec 2022 Haleh Akrami, Hannes Gamper

The mean opinion score (MOS) is standardized for the perceptual evaluation of speech quality and is obtained by asking listeners to rate the quality of a speech sample.

Multi-Task Learning

ICASSP 2022 Deep Noise Suppression Challenge

1 code implementation27 Feb 2022 Harishchandra Dubey, Vishak Gopal, Ross Cutler, Ashkan Aazami, Sergiy Matusevych, Sebastian Braun, Sefik Emre Eskimez, Manthan Thakker, Takuya Yoshioka, Hannes Gamper, Robert Aichner

We open-source datasets and test sets for researchers to train their deep noise suppression models, as well as a subjective evaluation framework based on ITU-T P. 835 to rate and rank-order the challenge entries.

ICASSP 2022 Acoustic Echo Cancellation Challenge

1 code implementation27 Feb 2022 Ross Cutler, Ando Saabas, Tanel Parnamaa, Marju Purin, Hannes Gamper, Sebastian Braun, Karsten Sørensen, Robert Aichner

This is the third AEC challenge and it is enhanced by including mobile scenarios, adding speech recognition rate in the challenge goal metrics, and making the default sample rate 48 kHz.

Acoustic echo cancellation Speech Enhancement +2

Effect of noise suppression losses on speech distortion and ASR performance

no code implementations23 Nov 2021 Sebastian Braun, Hannes Gamper

Deep learning based speech enhancement has made rapid development towards improving quality, while models are becoming more compact and usable for real-time on-the-edge inference.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Towards efficient models for real-time deep noise suppression

no code implementations22 Jan 2021 Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev

It is shown that the achievable speech quality is a function of network complexity, and show which models have better tradeoffs.

Speech Enhancement

Interspeech 2021 Deep Noise Suppression Challenge

2 code implementations6 Jan 2021 Chandan K A Reddy, Harishchandra Dubey, Kazuhito Koishida, Arun Nair, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan

In this version of the challenge organized at INTERSPEECH 2021, we are expanding both our training and test datasets to accommodate full band scenarios.

Denoising

ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets and Testing Framework

1 code implementation10 Sep 2020 Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios.

Acoustic echo cancellation Audio and Speech Processing Sound

Fast acoustic scattering using convolutional neural networks

1 code implementation30 Oct 2019 Ziqi Fan, Vibhav Vineet, Hannes Gamper, Nikunj Raghuvanshi

Diffracted scattering and occlusion are important acoustic effects in interactive auralization and noise control applications, typically requiring expensive numerical simulation.

Image-to-Image Regression regression

Cannot find the paper you are looking for? You can Submit a new open access paper.