no code implementations • 1 Feb 2024 • Bunlong Lay, Timo Gerkmann
The speech enhancement performance varies depending on the choice of the stochastic differential equation that controls the evolution of the mean and the variance along the diffusion processes when adding environmental and Gaussian noise.
1 code implementation • 18 Sep 2023 • Bunlong Lay, Jean-Marie Lemercier, Julius Richter, Timo Gerkmann
While the performance of usual generative diffusion algorithms drops dramatically when lowering the number of function evaluations (NFEs) to obtain single-step diffusion, we show that our proposed method keeps a steady performance and therefore largely outperforms the diffusion baseline in this setting and also generalizes better than its predictive counterpart.
no code implementations • 14 Sep 2023 • Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann
Subsequently, at inference, a target emotion embedding is employed to convert the emotion of the input utterance to the given target emotion.
no code implementations • 15 Mar 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann
In this paper, we present a causal speech signal improvement system that is designed to handle different types of distortions.
2 code implementations • 28 Feb 2023 • Bunlong Lay, Simon Welker, Julius Richter, Timo Gerkmann
Recently, score-based generative models have been successfully employed for the task of speech enhancement.
1 code implementation • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Timo Gerkmann
This matches our forward process which moves from clean speech to noisy speech by including a drift term.
Ranked #19 on Speech Enhancement on VoiceBank + DEMAND