no code implementations • 21 Nov 2023 • Weihan Xu, Julian McAuley, Shlomo Dubnov, Hao-Wen Dong
We then propose a simple technique to equip this pretrained unconditional music transformer model with instrument and genre controls by finetuning the model with additional control tokens.
no code implementations • 16 Jun 2023 • Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley
Our results show the effectiveness of the proposed method, and that the pretrained diffusion prior can reduce the modality transfer gap.
1 code implementation • 14 Dec 2022 • Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian McAuley, Taylor Berg-Kirkpatrick
Further, videos in the wild often contain off-screen sounds and background noise that may hinder the model from learning the desired audio-textual correspondence.
2 code implementations • 14 Jul 2022 • Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, Taylor Berg-Kirkpatrick
Existing approaches for generating multitrack music with transformer models have been limited in terms of the number of instruments, the length of the music segments and slow inference.
no code implementations • 12 Feb 2022 • Hao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, Julian McAuley
Music performance synthesis aims to synthesize a musical score into a natural performance.
1 code implementation • 3 Aug 2021 • Sachinda Edirisooriya, Hao-Wen Dong, Julian McAuley, Taylor Berg-Kirkpatrick
Monophonic and homophonic music can be described as homorhythmic, or having a single musical rhythm.
1 code implementation • 13 Jul 2021 • Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, Julian McAuley
In this paper, we aim to further extend this idea and examine the feasibility of automatic instrumentation -- dynamically assigning instruments to notes in solo music during performance.
2 code implementations • 5 Aug 2020 • Hao-Wen Dong, Ke Chen, Julian McAuley, Taylor Berg-Kirkpatrick
MusPy provides easy-to-use tools for essential components in a music generation system, including dataset management, data I/O, data preprocessing and model evaluation.
no code implementations • 8 Jan 2020 • Yin-Cheng Yeh, Wen-Yi Hsiao, Satoru Fukayama, Tetsuro Kitahara, Benjamin Genchel, Hao-Min Liu, Hao-Wen Dong, Yi-An Chen, Terence Leong, Yi-Hsuan Yang
Several prior works have proposed various methods for the task of automatic melody harmonization, in which a model aims to generate a sequence of chords to serve as the harmonic accompaniment of a given multiple-bar melody sequence.
1 code implementation • 25 Jan 2019 • Hao-Wen Dong, Yi-Hsuan Yang
2) How different combinations of output activation functions and regularization approaches perform empirically against one another?
1 code implementation • 10 Oct 2018 • Hao-Wen Dong, Yi-Hsuan Yang
We propose the BinaryGAN, a novel generative adversarial network (GAN) that uses binary neurons at the output layer of the generator.
3 code implementations • 25 Apr 2018 • Hao-Wen Dong, Yi-Hsuan Yang
Experimental results show that using binary neurons instead of HT or BS indeed leads to better results in a number of objective measures.
8 code implementations • 19 Sep 2017 • Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, Yi-Hsuan Yang
The three models, which differ in the underlying assumptions and accordingly the network architectures, are referred to as the jamming model, the composer model and the hybrid model.