no code implementations • ROCLING 2022 • Chi-En Dai, Qi-Wei Hong, Jeih-weih Hung
In this study, we present revising the STFT loss in DEMUCS by employing the compressed magnitude spectrogram.
no code implementations • ROCLING 2022 • Yan-Tong Chen, Zong-Tai Wu, Jeih-weih Hung
Nowadays, time-domain features have been widely used in speech enhancement (SE) networks like frequency-domain features to achieve excellent performance in eliminating noise from input utterances.
no code implementations • ROCLING 2021 • Yan-Tong Chen, Zi-Qiang Lin, Jeih-weih Hung
Preliminary experiments conducted on a subset of TIMIT corpus reveal that the proposed method can make the resulting IRM achieve higher speech quality and intelligibility for the babble noise-corrupted signals compared with the original IRM, indicating that the lowpass filtered temporal feature sequence can learn a superior IRM network for speech enhancement.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 26 Aug 2021 • Fu-An Chao, Jeih-weih Hung, Berlin Chen
In recent decades, many studies have suggested that phase information is crucial for speech enhancement (SE), and time-domain single-channel speech enhancement techniques have shown promise in noise suppression and robust automatic speech recognition (ASR).
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
1 code implementation • 4 Jul 2021 • Fu-An Chao, Shao-Wei Fan Jiang, Bi-Cheng Yan, Jeih-weih Hung, Berlin Chen
Due to the unprecedented breakthroughs brought about by deep learning, speech enhancement (SE) techniques have been developed rapidly and play an important role prior to acoustic modeling to mitigate noise effects on speech.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 15 Nov 2020 • Yen-Ju Lu, Chia-Yu Chang, Cheng Yu, Ching-Feng Liu, Jeih-weih Hung, Shinji Watanabe, Yu Tsao
Experimental results from speech denoising, speech dereverberation, and impaired speech enhancement tasks confirmed that contextual BPC information improves SE performance.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+5
no code implementations • 13 Aug 2020 • Yen-Ju Lu, Chien-Feng Liao, Xugang Lu, Jeih-weih Hung, Yu Tsao
In noisy conditions, knowing speech contents facilitates listeners to more effectively suppress background noise components and to retrieve pure speech signals.
no code implementations • 22 Nov 2019 • Cheng Yu, Kuo-Hsuan Hung, Syu-Siang Wang, Szu-Wei Fu, Yu Tsao, Jeih-weih Hung
Previous studies have proven that integrating video signals, as a complementary modality, can facilitate improved performance for speech enhancement (SE).
no code implementations • 19 Nov 2019 • Syu-Siang Wang, Yu-You Liang, Jeih-weih Hung, Yu Tsao, Hsin-Min Wang, Shih-Hau Fang
Speech-related applications deliver inferior performance in complex noise environments.
1 code implementation • 8 Nov 2018 • Shih-kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung
The presented DWT-based SE method with various scaling factors for the detail part is evaluated with a subset of Aurora-2 database, and the PESQ metric is used to indicate the quality of processed speech signals.