no code implementations • 8 Jun 2023 • Zhiyi Wang, Shaoguang Mao, Wenshan Wu, Yan Xia, Yan Deng, Jonathan Tien
To leverage NLP models, speech input is first force-aligned with texts, and then pre-processed into a token sequence, including words and phrase break information.
no code implementations • 8 Jun 2021 • Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He
Experimental results obtained by the Transformer TTS show that the proposed BERT can extract fine-grained, segment-level prosody, which is complementary to utterance-level prosody to improve the final prosody of the TTS speech.
no code implementations • 8 Apr 2021 • Fengpeng Yue, Yan Deng, Lei He, Tom Ko
Machine Speech Chain, which integrates both end-to-end (E2E) automatic speech recognition (ASR) and text-to-speech (TTS) into one circle for joint training, has been proven to be effective in data augmentation by leveraging large amounts of unpaired data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 3 Jun 2019 • Mutian He, Yan Deng, Lei He
In this paper, we propose a novel stepwise monotonic attention method in sequence-to-sequence acoustic modeling to improve the robustness on out-of-domain inputs.
no code implementations • 13 Dec 2018 • Yan Deng, Lei He, Frank Soong
Neural TTS has shown it can generate high quality synthesized speech.
no code implementations • CVPR 2014 • Yan Deng, Anand Rangarajan, Stephan Eisenschenk, Baba C. Vemuri
In this paper, we use the well known Riemannian framework never before used for point cloud matching, and present a novel matching algorithm.