Search Results for author: Hao-Wen Dong

Found 13 papers, 9 papers with code

Equipping Pretrained Unconditional Music Transformers with Instrument and Genre Controls

no code implementations • 21 Nov 2023 • Weihan Xu, Julian McAuley, Shlomo Dubnov, Hao-Wen Dong

We then propose a simple technique to equip this pretrained unconditional music transformer model with instrument and genre controls by finetuning the model with additional control tokens.

Music Generation

Paper
Add Code

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models

no code implementations • 16 Jun 2023 • Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley

Our results show the effectiveness of the proposed method, and that the pretrained diffusion prior can reduce the modality transfer gap.

Audio Synthesis

Paper
Add Code

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos

1 code implementation • 14 Dec 2022 • Hao-Wen Dong, Naoya Takahashi, Yuki Mitsufuji, Julian McAuley, Taylor Berg-Kirkpatrick

Further, videos in the wild often contain off-screen sounds and background noise that may hinder the model from learning the desired audio-textual correspondence.

Paper
Code

Multitrack Music Transformer

2 code implementations • 14 Jul 2022 • Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, Taylor Berg-Kirkpatrick

Existing approaches for generating multitrack music with transformer models have been limited in terms of the number of instruments, the length of the music segments and slow inference.

Music Generation

125

Paper
Code

Deep Performer: Score-to-Audio Music Performance Synthesis

no code implementations • 12 Feb 2022 • Hao-Wen Dong, Cong Zhou, Taylor Berg-Kirkpatrick, Julian McAuley

Music performance synthesis aims to synthesize a musical score into a natural performance.

Speech Synthesis Text-To-Speech Synthesis

Paper
Add Code

An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition

1 code implementation • 3 Aug 2021 • Sachinda Edirisooriya, Hao-Wen Dong, Julian McAuley, Taylor Berg-Kirkpatrick

Monophonic and homophonic music can be described as homorhythmic, or having a single musical rhythm.

Binary Classification

Paper
Code

Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music

1 code implementation • 13 Jul 2021 • Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, Julian McAuley

In this paper, we aim to further extend this idea and examine the feasibility of automatic instrumentation -- dynamically assigning instruments to notes in solo music during performance.

Multi-class Classification

Paper
Code

MusPy: A Toolkit for Symbolic Music Generation

2 code implementations • 5 Aug 2020 • Hao-Wen Dong, Ke Chen, Julian McAuley, Taylor Berg-Kirkpatrick

MusPy provides easy-to-use tools for essential components in a music generation system, including dataset management, data I/O, data preprocessing and model evaluation.

Management Music Generation

407

Paper
Code

Automatic Melody Harmonization with Triad Chords: A Comparative Study

no code implementations • 8 Jan 2020 • Yin-Cheng Yeh, Wen-Yi Hsiao, Satoru Fukayama, Tetsuro Kitahara, Benjamin Genchel, Hao-Min Liu, Hao-Wen Dong, Yi-An Chen, Terence Leong, Yi-Hsuan Yang

Several prior works have proposed various methods for the task of automatic melody harmonization, in which a model aims to generate a sequence of chords to serve as the harmonic accompaniment of a given multiple-bar melody sequence.

Template Matching

Paper
Add Code

On Output Activation Functions for Adversarial Losses: A Theoretical Analysis via Variational Divergence Minimization and An Empirical Study on MNIST Classification

1 code implementation • 25 Jan 2019 • Hao-Wen Dong, Yi-Hsuan Yang

2) How different combinations of output activation functions and regularization approaches perform empirically against one another?

Paper
Code

Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation

1 code implementation • 10 Oct 2018 • Hao-Wen Dong, Yi-Hsuan Yang

We propose the BinaryGAN, a novel generative adversarial network (GAN) that uses binary neurons at the output layer of the generator.

Generative Adversarial Network

Paper
Code

Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation

3 code implementations • 25 Apr 2018 • Hao-Wen Dong, Yi-Hsuan Yang

Experimental results show that using binary neurons instead of HT or BS indeed leads to better results in a number of objective measures.

Music Generation

1,704

Paper
Code

MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment

8 code implementations • 19 Sep 2017 • Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, Yi-Hsuan Yang

The three models, which differ in the underlying assumptions and accordingly the network architectures, are referred to as the jamming model, the composer model and the hybrid model.

Music Generation

1,704

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.