1 code implementation • 12 Oct 2022 • Yueh-Kao Wu, Ching-Yu Chiu, Yi-Hsuan Yang
Instead of generating the drum track directly as waveforms, we use a separate VQ-VAE to encode the mel-spectrogram of a drum track into another set of discrete codes, and train the Transformer to predict the sequence of drum-related discrete codes.
1 code implementation • 26 Aug 2020 • Antonio Ramires, Frederic Font, Dmitry Bogdanov, Jordan B. L. Smith, Yi-Hsuan Yang, Joann Ching, Bo-Yu Chen, Yueh-Kao Wu, Hsu Wei-Han, Xavier Serra
We present the Freesound Loop Dataset (FSLD), a new large-scale dataset of music loops annotated by experts.
Audio and Speech Processing Sound