1 code implementation • 6 Aug 2018 • Yuanbo Hou, Qiuqiang Kong, Shengchen Li
To use the order information of sound events, we propose sequential labelled data (SLD), where both the presence or absence and the order information of sound events are known.
no code implementations • 27 Dec 2019 • Yusong Wu, Shengchen Li, Chengzhu Yu, Heng Lu, Chao Weng, Liqiang Zhang, Dong Yu
This paper presents a method that generates expressive singing voice of Peking opera.
no code implementations • 7 Aug 2020 • Yusong Wu, Shengchen Li, Chengzhu Yu, Heng Lu, Chao Weng, Liqiang Zhang, Dong Yu
In this work, we propose to deal with this issue and synthesize expressive Peking Opera singing from the music score based on the Duration Informed Attention Network (DurIAN) framework.
no code implementations • 7 Dec 2020 • Shengchen Li, Yinji Jing, György Fazekas
The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.
1 code implementation • 5 Aug 2021 • Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang
Automated audio captioning aims to use natural language to describe the content of audio data.
1 code implementation • 4 Aug 2022 • Jingyi Wang, Shengchen Li
The result also verified that the performance improvement for quantization and SIMD instruction.
no code implementations • 12 Aug 2022 • Peiran Yan, Shengchen Li
In this paper, a series of pre-trained models are investigated for the correlation between extracted audio features and the performance of audio captioning.
no code implementations • 17 Aug 2022 • Ruowei Xing, Shengchen Li
Analysing the different performance of these two methods, PYIN is applied to supplement the F0 extracted from the trained CNN model to combine the advantages of these two algorithms.
1 code implementation • 28 Oct 2022 • Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, Lilian H. Tang, Mark D. Plumbley, Volkan Kılıç, Wenwu Wang
Audio captioning aims to generate text descriptions of audio clips.
no code implementations • 31 Jan 2023 • Yuqiang Li, Shengchen Li, George Fazekas
Results display a general phenomenon of over-fitting from two aspects, the pitch embedding space and the test loss of the single-token grid encoding.