Search Results for author: Chunyu Qiang

Found 6 papers, 0 papers with code

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models

no code implementations • 27 Sep 2023 • Chunyu Qiang, Hao Li, Yixin Tian, Yi Zhao, Ying Zhang, Longbiao Wang, Jianwu Dang

To address these issues, we propose a minimally-supervised high-fidelity speech synthesis method, where all modules are constructed based on the diffusion models.

Speech Synthesis Voice Cloning

Paper
Add Code

Learning Speech Representation From Contrastive Token-Acoustic Pretraining

no code implementations • 1 Sep 2023 • Chunyu Qiang, Hao Li, Yixin Tian, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

However, existing contrastive learning methods in the audio field focus on extracting global descriptive information for downstream audio classification tasks, making them unsuitable for TTS, VC, and ASR tasks.

Audio Classification Automatic Speech Recognition +5

Paper
Add Code

Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding

no code implementations • 28 Jul 2023 • Chunyu Qiang, Hao Li, Hao Ni, He Qu, Ruibo Fu, Tao Wang, Longbiao Wang, Jianwu Dang

However, existing methods suffer from three problems: the high dimensionality and waveform distortion of discrete speech representations, the prosodic averaging problem caused by the duration prediction model in non-autoregressive frameworks, and the information redundancy and dimension explosion problems of existing semantic encoding methods.

Language Modelling Speech Synthesis

Paper
Add Code

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

no code implementations • 14 Mar 2023 • Chunyu Qiang, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang

Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesized speech of a target speaker's timbre.

Prosody Prediction Speech Synthesis +1

Paper
Add Code

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

no code implementations • 13 Dec 2022 • Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang

In order to improve the style extraction ability of the reference encoder, a style invariant and contrastive data augmentation method is proposed.

Data Augmentation Speech Synthesis +1

Paper
Add Code

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

no code implementations • 17 Nov 2022 • Chunyu Qiang, Peng Yang, Hao Che, Jinba Xiao, Xiaorui Wang, Zhongyuan Wang

In this paper we propose a simple back-translation-style data augmentation method for mandarin Chinese polyphone disambiguation, utilizing a large amount of unlabeled text data.

Data Augmentation Machine Translation +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.