1 code implementation • 31 Oct 2020 • Yen-Hao Chen, Da-Yi Wu, Tsung-Han Wu, Hung-Yi Lee
With a proper activation as an information bottleneck on content embeddings, the trade-off between the synthesis quality and the speaker similarity of the converted speech is improved drastically.
Audio and Speech Processing Sound
no code implementations • 9 Jun 2020 • Tsung-Han Wu, Chun-Chen Hsieh, Yen-Hao Chen, Po-Han Chi, Hung-Yi Lee
In this paper, we seek solutions for reducing the computation complexity of transformer-based models for speech representation learning.
1 code implementation • 7 Jun 2020 • Da-Yi Wu, Yen-Hao Chen, Hung-Yi Lee
Voice conversion (VC) is a task that transforms the source speaker's timbre, accent, and tones in audio into another one's while preserving the linguistic content.
4 code implementations • 18 May 2020 • Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Yen-Hao Chen, Shang-Wen Li, Hung-Yi Lee
We use the representations with two downstream tasks, speaker identification, and phoneme classification.