Search Results for author: Bunlong Lay

Found 6 papers, 3 papers with code

An Analysis of the Variance of Diffusion-based Speech Enhancement

no code implementations • 1 Feb 2024 • Bunlong Lay, Timo Gerkmann

The speech enhancement performance varies depending on the choice of the stochastic differential equation that controls the evolution of the mean and the variance along the diffusion processes when adding environmental and Gaussian noise.

Speech Enhancement

Paper
Add Code

Single and Few-step Diffusion for Generative Speech Enhancement

1 code implementation • 18 Sep 2023 • Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann

While the performance of usual generative diffusion algorithms drops dramatically when lowering the number of function evaluations (NFEs) to obtain single-step diffusion, we show that our proposed method keeps a steady performance and therefore largely outperforms the diffusion baseline in this setting and also generalizes better than its predictive counterpart.

Denoising Speech Enhancement

Paper
Code

EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild Data

no code implementations • 14 Sep 2023 • Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann

Subsequently, at inference, a target emotion embedding is employed to convert the emotion of the input utterance to the given target emotion.

Paper
Add Code

Speech Signal Improvement Using Causal Generative Diffusion Models

no code implementations • 15 Mar 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann

In this paper, we present a causal speech signal improvement system that is designed to handle different types of distortions.

Paper
Add Code

Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement

2 code implementations • 28 Feb 2023 • Bunlong Lay, Simon Welker, Julius Richter, Timo Gerkmann

Recently, score-based generative models have been successfully employed for the task of speech enhancement.

Speech Enhancement

385

Paper
Code

Speech Enhancement and Dereverberation with Diffusion-based Generative Models

1 code implementation • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Timo Gerkmann

This matches our forward process which moves from clean speech to noisy speech by including a drift term.

Ranked #19 on Speech Enhancement on VoiceBank + DEMAND

Speech Dereverberation

385

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.