no code implementations • 12 May 2023 • Peter Plantinga, Jaekwon Yoo, Chandra Dhir
Our experiments show that a simple linear interpolation of several models' parameters, each fine-tuned from the same generalist model, results in a single model that performs well on all tested data.
4 code implementations • 8 Jun 2021 • Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato de Mori, Yoshua Bengio
SpeechBrain is an open-source and all-in-one speech toolkit.
3 code implementations • 8 Apr 2021 • Szu-Wei Fu, Cheng Yu, Tsun-An Hsieh, Peter Plantinga, Mirco Ravanelli, Xugang Lu, Yu Tsao
The discrepancy between the cost function used for training a speech enhancement model and human auditory perception usually makes the quality of enhanced speech unsatisfactory.
Ranked #9 on
Speech Enhancement
on VoiceBank + DEMAND
no code implementations • 3 Mar 2020 • Peter Plantinga, Eric Fosler-Lussier
The other loss term uses a uni-directional model as teacher model to align the bi-directional model.
1 code implementation • 3 Mar 2020 • Peter Plantinga, Deblin Bagchi, Eric Fosler-Lussier
While deep learning systems have gained significant ground in speech enhancement research, these systems have yet to make use of the full potential of deep learning systems to provide high-level feedback.
1 code implementation • 25 Sep 2018 • Peter Plantinga, Deblin Bagchi, Eric Fosler-Lussier
Spectral mapping uses a deep neural network (DNN) to map directly from noisy speech to clean speech.
Sound Audio and Speech Processing
no code implementations • 26 Mar 2018 • Deblin Bagchi, Peter Plantinga, Adam Stiff, Eric Fosler-Lussier
For the task of speech enhancement, local learning objectives are agnostic to phonetic structures helpful for speech recognition.