Search Results for author: Shengchen Li

Found 10 papers, 4 papers with code

Audio Tagging With Connectionist Temporal Classification Model Using Sequential Labelled Data

1 code implementation6 Aug 2018 Yuanbo Hou, Qiuqiang Kong, Shengchen Li

To use the order information of sound events, we propose sequential labelled data (SLD), where both the presence or absence and the order information of sound events are known.

Audio Tagging General Classification

Peking Opera Synthesis via Duration Informed Attention Network

no code implementations7 Aug 2020 Yusong Wu, Shengchen Li, Chengzhu Yu, Heng Lu, Chao Weng, Liqiang Zhang, Dong Yu

In this work, we propose to deal with this issue and synthesize expressive Peking Opera singing from the music score based on the Duration Informed Attention Network (DurIAN) framework.

Singing Voice Synthesis

A novel dataset for the identification of computer generated melodies in the CSMT challenge

no code implementations7 Dec 2020 Shengchen Li, Yinji Jing, György Fazekas

The aim of the dataset is to examine whether it is possible to distinguish computer generated melodies by learning the feature of generated melodies.

An investigation on selecting audio pre-trained models for audio captioning

no code implementations12 Aug 2022 Peiran Yan, Shengchen Li

In this paper, a series of pre-trained models are investigated for the correlation between extracted audio features and the performance of audio captioning.

Audio captioning

Extract fundamental frequency based on CNN combined with PYIN

no code implementations17 Aug 2022 Ruowei Xing, Shengchen Li

Analysing the different performance of these two methods, PYIN is applied to supplement the F0 extracted from the trained CNN model to combine the advantages of these two algorithms.

An Comparative Analysis of Different Pitch and Metrical Grid Encoding Methods in the Task of Sequential Music Generation

no code implementations31 Jan 2023 Yuqiang Li, Shengchen Li, George Fazekas

Results display a general phenomenon of over-fitting from two aspects, the pitch embedding space and the test loss of the single-token grid encoding.

Feature Engineering Music Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.