no code implementations • 17 May 2019 • Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark
The prosodic aspects of speech signals produced by current text-to-speech systems are typically averaged over training material, and as such lack the variety and liveliness found in natural speech.
no code implementations • 7 Sep 2015 • Cheng-Tao Chung, Chun-an Chan, Lin-shan Lee
This linguistic structure includes two-level (subword-like and word-like) acoustic patterns, the lexicon of word-like patterns in terms of subword-like patterns and the N-gram language model based on word-like patterns.
no code implementations • 7 Sep 2015 • Cheng-Tao Chung, Chun-an Chan, Lin-shan Lee
This paper presents a new approach for unsupervised Spoken Term Detection with spoken queries using multiple sets of acoustic patterns automatically discovered from the target corpus.