Search Results for author: Xiaorui Wang

Found 16 papers, 7 papers with code

ChipSong: A Controllable Lyric Generation System for Chinese Popular Song

1 code implementation In2Writing (ACL) 2022 Nayu Liu, Wenjing Han, Guangcan Liu, Da Peng, Ran Zhang, Xiaorui Wang, Huabin Ruan

In this work, we take a further step towards satisfying practical demands in Chinese lyric generation from musical short-video creators, in respect of the challenges on songs’ format constraints, creating specific lyrics from open-ended inspiration inputs, and language rhyme grace.

Language Modelling Sentence

Filter Pruning via Filters Similarity in Consecutive Layers

no code implementations26 Apr 2023 Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie

Filter pruning is widely adopted to compress and accelerate the Convolutional Neural Networks (CNNs), but most previous works ignore the relationship between filters and channels in different layers.

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

no code implementations13 Dec 2022 Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang

In order to improve the style extraction ability of the reference encoder, a style invariant and contrastive data augmentation method is proposed.

Data Augmentation Speech Synthesis +1

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

no code implementations17 Nov 2022 Chunyu Qiang, Peng Yang, Hao Che, Jinba Xiao, Xiaorui Wang, Zhongyuan Wang

In this paper we propose a simple back-translation-style data augmentation method for mandarin Chinese polyphone disambiguation, utilizing a large amount of unlabeled text data.

Data Augmentation Machine Translation +3

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

no code implementations17 Sep 2022 Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang

Experimental results show that the proposed model achieves competitive performance with 1/3 of the parameters of the encoder, compared with the full-parameter model.

Knowledge Distillation speech-recognition +1

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

1 code implementation18 Sep 2021 Wentao Zhu, Tianlong Kong, Shun Lu, Jixiang Li, Dawei Zhang, Feng Deng, Xiaorui Wang, Sen yang, Ji Liu

Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances.

Neural Architecture Search Speaker Recognition +2

Dynamic Multi-scale Convolution for Dialect Identification

1 code implementation2 Aug 2021 Tianlong Kong, Shouyi Yin, Dawei Zhang, Wang Geng, Xin Wang, Dandan song, Jinwen Huang, Huiyu Shi, Xiaorui Wang

To address this issue, we propose a new architecture, named dynamic multi-scale convolution, which consists of dynamic kernel convolution, local multi-scale learning, and global multi-scale pooling.

Dialect Identification

Multi-Task Audio Source Separation

1 code implementation14 Jul 2021 Lu Zhang, Chenxing Li, Feng Deng, Xiaorui Wang

In detail, the proposed model follows a two-stage pipeline, which separates the three types of audio signals and then performs signal compensation separately.

Audio Source Separation Multi-task Audio Source Seperation +3

Gaussian Dynamic Convolution for Efficient Single-Image Segmentation

no code implementations18 Apr 2021 Xin Sun, Changrui Chen, Xiaorui Wang, Junyu Dong, Huiyu Zhou, Sheng Chen

Furthermore, we also build a Gaussian dynamic pyramid Pooling to show its potential and generality in common semantic segmentation.

Image Segmentation Segmentation +1

Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context

no code implementations26 Nov 2018 Jie Li, Yahui Shan, Xiaorui Wang, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Gated Recurrent Unit Based Acoustic Modeling with Future Context

no code implementations18 May 2018 Jie Li, Xiaorui Wang, Yuan-Yuan Zhao, Yan Li

The use of future contextual information is typically shown to be helpful for acoustic modeling.

Cannot find the paper you are looking for? You can Submit a new open access paper.