no code implementations • 28 Nov 2020 • Man-Ling Sung, Tan Lee
The Siamese/Triplet network is trained on the hypothesized examples to measure the similarity between two speech segments and hereby perform re-clustering of all hypothesized subword sequences to achieve spoken term discovery.
no code implementations • 3 Nov 2020 • Man-Ling Sung, Siyuan Feng, Tan Lee
With the unsupervisedly trained acoustic models, a given audio archive is represented by a pseudo transcription, from which spoken keywords can be discovered by string mining algorithms.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 18 Sep 2019 • Herbert Gish, Jan Silovsky, Man-Ling Sung, Man-Hung Siu, William Hartmann, Zhuolin Jiang
This includes results about the ability of the noisy model to make the same decisions as the clean model and the effects of noise on model performance.