no code implementations • 7 Dec 2023 • Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May
To address this, we extend this framework to account for the progressive transformation between the clean and noisy speech signals.
no code implementations • 5 Dec 2023 • Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May
We show that the proposed system substantially benefits from using multiple databases for training, and achieves superior performance compared to state-of-the-art discriminative models in both matched and mismatched conditions.
no code implementations • 12 Sep 2023 • Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
We find that for all models, the performance degrades the most in speech mismatches, while good noise and room generalization can be achieved by training on multiple databases.
no code implementations • 25 Jan 2023 • Philippe Gonzalez, Tommy Sonne Alstrøm, Tobias May
This paper systematically investigates the effect of different batching strategies and batch sizes on the training statistics and speech enhancement performance of a Conv-TasNet, evaluated in both matched and mismatched conditions.