Search Results for author: Sebastian Ewert

Found 11 papers, 8 papers with code

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

1 code implementation • 18 Mar 2022 • Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems.

Ranked #3 on Music Transcription on Slakh2100

Music Transcription

2,918

Paper
Code

Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation

9 code implementations • 8 Jun 2018 • Daniel Stoller, Sebastian Ewert, Simon Dixon

Models for audio source separation usually operate on the magnitude spectrum, which ignores phase information and makes separation performance dependant on hyper-parameters for the spectral front-end.

Ranked #27 on Music Source Separation on MUSDB18

Audio Source Separation Music Source Separation

791

Paper
Code

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

3 code implementations • 31 Oct 2017 • Daniel Stoller, Sebastian Ewert, Simon Dixon

Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples.

Audio Source Separation Data Augmentation +1

Paper
Code

Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

1 code implementation • 14 Nov 2019 • Daniel Stoller, Mi Tian, Sebastian Ewert, Simon Dixon

In comparison to TCN and Wavenet, our network consistently saves memory and computation time, with speed-ups for training and inference of over 4x in the audio generation experiment in particular, while achieving a comparable performance in all tasks.

Ranked #2 on Music Modeling on Nottingham

Audio Generation Causal Language Modeling +2

Paper
Code

End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition Model

2 code implementations • 18 Feb 2019 • Daniel Stoller, Simon Durand, Sebastian Ewert

Time-aligned lyrics can enrich the music listening experience by enabling karaoke, text-based song retrieval and intra-song navigation, and other applications.

Retrieval

Paper
Code

Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages

1 code implementation • 13 Jun 2023 • Simon Durand, Daniel Stoller, Sebastian Ewert

This way, we obtain a novel system that is simple to train end-to-end, can make use of weakly annotated training data, jointly learns a powerful text model, and is tailored to alignment.

Contrastive Learning speech-recognition +1

Paper
Code

Training Generative Adversarial Networks from Incomplete Observations using Factorised Discriminators

1 code implementation • ICLR 2020 • Daniel Stoller, Sebastian Ewert, Simon Dixon

We apply our method to image generation, image segmentation and audio source separation, and obtain improved performance over a standard GAN when additional incomplete training examples are available.

Audio Source Separation Image Generation +3

Paper
Code

Towards Robust Unsupervised Disentanglement of Sequential Data -- A Case Study Using Music Audio

1 code implementation • 12 May 2022 • Yin-Jyun Luo, Sebastian Ewert, Simon Dixon

In this paper, we show that the vanilla DSAE suffers from being sensitive to the choice of model architecture and capacity of the dynamic latent variables, and is prone to collapse the static latent variable.

Data Augmentation Disentanglement +1

Paper
Code

Jointly Detecting and Separating Singing Voice: A Multi-Task Approach

no code implementations • 5 Apr 2018 • Daniel Stoller, Sebastian Ewert, Simon Dixon

A main challenge in applying deep learning to music processing is the availability of training data.

Action Detection Activity Detection +1

Paper
Add Code

An Augmented Lagrangian Method for Piano Transcription using Equal Loudness Thresholding and LSTM-based Decoding

no code implementations • 1 Jul 2017 • Sebastian Ewert, Mark B. Sandler

A central goal in automatic music transcription is to detect individual note events in music recordings.

Music Transcription

Paper
Add Code

Structured Dropout for Weak Label and Multi-Instance Learning and Its Application to Score-Informed Source Separation

no code implementations • 15 Sep 2016 • Sebastian Ewert, Mark B. Sandler

Many success stories involving deep neural networks are instances of supervised learning, where available labels power gradient-based learning methods.

Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.