no code implementations • 7 Jun 2015 • Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Yuan-ming Liou, Yen-chen Wu, Yen-Ju Lu, Hung-Yi Lee, Lin-shan Lee
The Multi-layered Acoustic Tokenizer (MAT) proposed in this work automatically discovers multiple sets of acoustic tokens from the given corpus.
no code implementations • 7 Sep 2015 • Cheng-Tao Chung, Chun-an Chan, Lin-shan Lee
This linguistic structure includes two-level (subword-like and word-like) acoustic patterns, the lexicon of word-like patterns in terms of subword-like patterns and the N-gram language model based on word-like patterns.
no code implementations • 7 Sep 2015 • Cheng-Tao Chung, Chun-an Chan, Lin-shan Lee
This paper presents a new approach for unsupervised Spoken Term Detection with spoken queries using multiple sets of acoustic patterns automatically discovered from the target corpus.
no code implementations • 7 Sep 2015 • Cheng-Tao Chung, Wei-Ning Hsu, Cheng-Yi Lee, Lin-shan Lee
This paper presents a novel approach for enhancing the multiple sets of acoustic patterns automatically discovered from a given corpus.
no code implementations • MediaEval 2015 Workshop 2015 • Cheng-Tao Chung, Yang-De Chen
This note serves as a documentation describing the methods the authors of this paper implemented for the Query by Example Search on Speech Task (QUESST) as a part of MediaEval 2015.
Ranked #58 on Keyword Spotting on QUESST
no code implementations • 1 Feb 2016 • Cheng-Tao Chung, Cheng-Yu Tsai, Hsiang-Hung Lu, Chia-Hsiang Liu, Hung-Yi Lee, Lin-shan Lee
The multiple sets of token labels are then used as the targets of a Multi-target Deep Neural Network (MDNN) trained on low-level acoustic features.
1 code implementation • 22 Mar 2017 • Yu-Hsuan Wang, Cheng-Tao Chung, Hung-Yi Lee
In this paper we analyze the gate activation signals inside the gated recurrent neural networks, and find the temporal structure of such signals is highly correlated with the phoneme boundaries.
no code implementations • 17 Jul 2017 • Cheng-Tao Chung, Cheng-Yu Tsai, Chia-Hsiang Liu, Lin-shan Lee
A Multi-granular Acoustic Tokenizer (MAT) was proposed for automatic discovery of multiple sets of acoustic tokens from the given corpus.
no code implementations • 28 Nov 2017 • Cheng-Tao Chung, Lin-shan Lee
In this paper, we compare two paradigms for unsupervised discovery of structured acoustic tokens directly from speech corpora without any human annotation.