Layer-wise Analysis of a Self-supervised Speech Representation Model

no code implementations10 Jul 2021 Ankita Pasad, Ju-chieh Chou, Karen Livescu

Recently proposed self-supervised learning approaches have been successful for pre-training speech representation models.

One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization

10 code implementations10 Apr 2019 Ju-chieh Chou, Cheng-chieh Yeh, Hung-Yi Lee

Recently, voice conversion (VC) without parallel data has been successfully adapted to multi-target scenario in which a single model is trained to convert the input voice to many different speakers.

Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

1 code implementation9 Aug 2018 Cheng-chieh Yeh, Po-chun Hsu, Ju-chieh Chou, Hung-Yi Lee, Lin-shan Lee

In this way, the length constraint mentioned above is removed to offer rhythm-flexible voice conversion without requiring parallel data.

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations

4 code implementations9 Apr 2018 Ju-chieh Chou, Cheng-chieh Yeh, Hung-Yi Lee, Lin-shan Lee

The decoder then takes the speaker-independent latent representation and the target speaker embedding as the input to generate the voice of the target speaker with the linguistic content of the source utterance.

Leveraging Linguistic Structures for Named Entity Recognition with Bidirectional Recursive Neural Networks

1 code implementation EMNLP 2017 Peng-Hsuan Li, Ruo-Ping Dong, Yu-Siang Wang, Ju-chieh Chou, Wei-Yun Ma

Motivated by the observation that named entities are highly related to linguistic constituents, we propose a constituent-based BRNN-CNN for named entity recognition.

