1 code implementation • 18 Mar 2022 • Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert
Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems.
Ranked #3 on Music Transcription on Slakh2100
9 code implementations • 8 Jun 2018 • Daniel Stoller, Sebastian Ewert, Simon Dixon
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase information and makes separation performance dependant on hyper-parameters for the spectral front-end.
Ranked #27 on Music Source Separation on MUSDB18
3 code implementations • 31 Oct 2017 • Daniel Stoller, Sebastian Ewert, Simon Dixon
Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples.
1 code implementation • 14 Nov 2019 • Daniel Stoller, Mi Tian, Sebastian Ewert, Simon Dixon
In comparison to TCN and Wavenet, our network consistently saves memory and computation time, with speed-ups for training and inference of over 4x in the audio generation experiment in particular, while achieving a comparable performance in all tasks.
Ranked #2 on Music Modeling on Nottingham
2 code implementations • 18 Feb 2019 • Daniel Stoller, Simon Durand, Sebastian Ewert
Time-aligned lyrics can enrich the music listening experience by enabling karaoke, text-based song retrieval and intra-song navigation, and other applications.
1 code implementation • 13 Jun 2023 • Simon Durand, Daniel Stoller, Sebastian Ewert
This way, we obtain a novel system that is simple to train end-to-end, can make use of weakly annotated training data, jointly learns a powerful text model, and is tailored to alignment.
1 code implementation • ICLR 2020 • Daniel Stoller, Sebastian Ewert, Simon Dixon
We apply our method to image generation, image segmentation and audio source separation, and obtain improved performance over a standard GAN when additional incomplete training examples are available.
1 code implementation • 12 May 2022 • Yin-Jyun Luo, Sebastian Ewert, Simon Dixon
In this paper, we show that the vanilla DSAE suffers from being sensitive to the choice of model architecture and capacity of the dynamic latent variables, and is prone to collapse the static latent variable.
no code implementations • 5 Apr 2018 • Daniel Stoller, Sebastian Ewert, Simon Dixon
A main challenge in applying deep learning to music processing is the availability of training data.
no code implementations • 1 Jul 2017 • Sebastian Ewert, Mark B. Sandler
A central goal in automatic music transcription is to detect individual note events in music recordings.
no code implementations • 15 Sep 2016 • Sebastian Ewert, Mark B. Sandler
Many success stories involving deep neural networks are instances of supervised learning, where available labels power gradient-based learning methods.