Search Results for author: Jiaen Liang

Found 7 papers, 1 papers with code

Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

no code implementations5 Jun 2023 Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence.

Sentence Speech Synthesis

M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis

no code implementations3 May 2023 Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, JianHua Tao, Jianqing Sun, Jiaen Liang

However, it is still a challenge to comprehensively model the conversation, and a majority of conversational TTS systems only focus on extracting global information and omit local prosody features, which contain important fine-grained information like keywords and emphasis.

Speech Synthesis Text-To-Speech Synthesis

ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis

1 code implementation20 Mar 2022 Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang

In recent years, neural network based methods for multi-speaker text-to-speech synthesis (TTS) have made significant progress.

Speaker Verification Speech Synthesis +1

Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection

no code implementations4 Mar 2022 Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang

In recent years, exploring effective sound separation (SSep) techniques to improve overlapping sound event detection (SED) attracts more and more attention.

Event Detection Sound Event Detection

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier

no code implementations26 Mar 2021 Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang

Domain mismatch is a noteworthy issue in acoustic event detection tasks, as the target domain data is difficult to access in most real applications.

Event Detection

Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection

no code implementations23 Mar 2021 Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang, Yuping Wang

A good joint training framework is very helpful to improve the performances of weakly supervised audio tagging (AT) and acoustic event detection (AED) simultaneously.

Audio Tagging Event Detection

Attention-based scaling adaptation for target speech extraction

no code implementations19 Oct 2020 Jiangyu Han, Wei Rao, Yanhua Long, Jiaen Liang

Furthermore, by introducing a mixture embedding matrix pooling method, our proposed attention-based scaling adaptation (ASA) can exploit the target speaker clues in a more efficient way.

Speech Extraction

Cannot find the paper you are looking for? You can Submit a new open access paper.