1 code implementation • In2Writing (ACL) 2022 • Nayu Liu, Wenjing Han, Guangcan Liu, Da Peng, Ran Zhang, Xiaorui Wang, Huabin Ruan
In this work, we take a further step towards satisfying practical demands in Chinese lyric generation from musical short-video creators, in respect of the challenges on songs’ format constraints, creating specific lyrics from open-ended inspiration inputs, and language rhyme grace.
no code implementations • 26 Apr 2023 • Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie
Filter pruning is widely adopted to compress and accelerate the Convolutional Neural Networks (CNNs), but most previous works ignore the relationship between filters and channels in different layers.
no code implementations • 14 Mar 2023 • Chunyu Qiang, Peng Yang, Hao Che, Ying Zhang, Xiaorui Wang, Zhongyuan Wang
Cross-speaker style transfer in speech synthesis aims at transferring a style from source speaker to synthesized speech of a target speaker's timbre.
no code implementations • 13 Dec 2022 • Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang
In order to improve the style extraction ability of the reference encoder, a style invariant and contrastive data augmentation method is proposed.
no code implementations • 17 Nov 2022 • Chunyu Qiang, Peng Yang, Hao Che, Jinba Xiao, Xiaorui Wang, Zhongyuan Wang
In this paper we propose a simple back-translation-style data augmentation method for mandarin Chinese polyphone disambiguation, utilizing a large amount of unlabeled text data.
no code implementations • 17 Sep 2022 • Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang
Experimental results show that the proposed model achieves competitive performance with 1/3 of the parameters of the encoder, compared with the full-parameter model.
1 code implementation • Nature Machine Intelligence 2022 • Yuquan Li, Chang-Yu Hsieh, Ruiqiang Lu, Xiaoqing Gong, Xiaorui Wang, Pengyong Li, Shuo Liu, Yanan Tian, Dejun Jiang, Jiaxian Yan, Qifeng Bai, Huanxiang Liu, Shengyu Zhang, Xiaojun Yao
In fact, the pursuit of high prediction performance on a limited number of datasets has crystallized their architectures and hyperparameters, making them lose advantage in repurposing to new data generated in drug discovery.
Ranked #1 on Drug Discovery on ToxCast (Toxicity Forecaster)
1 code implementation • 18 Sep 2021 • Wentao Zhu, Tianlong Kong, Shun Lu, Jixiang Li, Dawei Zhang, Feng Deng, Xiaorui Wang, Sen yang, Ji Liu
Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances.
Ranked #1 on Speaker Verification on VoxCeleb1
1 code implementation • Chemical Engineering Journal 2021 • Xiaorui Wang, Yuquan Li, Jiezhong Qiu, Guangyong Chen, Huanxiang Liu, Benben Liao, Chang-Yu Hsieh, Xiaojun Yaoa
RetroPrime achieves the Top-1 accuracy of 64. 8% and 51. 4%, when the reaction type is known and unknown, respectively, in the USPTO-50 K dataset.
Ranked #12 on Single-step retrosynthesis on USPTO-50k
1 code implementation • 2 Aug 2021 • Tianlong Kong, Shouyi Yin, Dawei Zhang, Wang Geng, Xin Wang, Dandan song, Jinwen Huang, Huiyu Shi, Xiaorui Wang
To address this issue, we propose a new architecture, named dynamic multi-scale convolution, which consists of dynamic kernel convolution, local multi-scale learning, and global multi-scale pooling.
1 code implementation • 14 Jul 2021 • Lu Zhang, Chenxing Li, Feng Deng, Xiaorui Wang
In detail, the proposed model follows a two-stage pipeline, which separates the three types of audio signals and then performs signal compensation separately.
Ranked #1 on Multi-task Audio Source Seperation on MTASS
Audio Source Separation Multi-task Audio Source Seperation +3
no code implementations • 18 Apr 2021 • Xin Sun, Changrui Chen, Xiaorui Wang, Junyu Dong, Huiyu Zhou, Sheng Chen
Furthermore, we also build a Gaussian dynamic pyramid Pooling to show its potential and generality in common semantic segmentation.
no code implementations • 29 Oct 2019 • Xinyong Zhou, Hao Che, Xiaorui Wang, Lei Xie
In this paper, we present a cross-lingual voice cloning approach.
1 code implementation • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019 • YuanYuan Zhao, Jie Li, Xiaorui Wang, Yan Li
Attention-based sequence-to-sequence architectures have made great progress in the speech recognition task.
no code implementations • 26 Nov 2018 • Jie Li, Yahui Shan, Xiaorui Wang, Yan Li
The use of future contextual information is typically shown to be helpful for acoustic modeling.
no code implementations • 18 May 2018 • Jie Li, Xiaorui Wang, Yuan-Yuan Zhao, Yan Li
The use of future contextual information is typically shown to be helpful for acoustic modeling.