Search Results for author: Fabian-Robert Stöter

Found 9 papers, 7 papers with code

Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark

1 code implementation23 Nov 2023 Ondřej Cífka, Constantinos Dimitriou, Cheng-i Wang, Hendrik Schreiber, Luke Miner, Fabian-Robert Stöter

Current automatic lyrics transcription (ALT) benchmarks focus exclusively on word content and ignore the finer nuances of written lyrics including formatting and punctuation, which leads to a potential misalignment with the creative products of musicians and songwriters as well as listeners' experiences.

Automatic Lyrics Transcription

Music Demixing Challenge 2021

1 code implementation31 Aug 2021 Yuki Mitsufuji, Giorgio Fabbro, Stefan Uhlich, Fabian-Robert Stöter, Alexandre Défossez, Minseok Kim, Woosung Choi, Chin-Yun Yu, Kin-Wai Cheuk

The main differences compared with the past challenges are 1) the competition is designed to more easily allow machine learning practitioners from other disciplines to participate, 2) evaluation is done on a hidden test set created by music professionals dedicated exclusively to the challenge to assure the transparency of the challenge, i. e., the test set is not accessible from anyone except the challenge organizers, and 3) the dataset provides a wider range of music genres and involved a greater number of mixing engineers.

Music Source Separation

Open-Unmix - A Reference Implementation for Music Source Separation

1 code implementation The Journal of Open Source Software 2019 Fabian-Robert Stöter, Stefan Uhlich, Antoine Liutkus, and YukiMitsufuji

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

Ranked #15 on Music Source Separation on MUSDB18 (using extra training data)

Music Source Separation

CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning Speaker Count Estimation

1 code implementation IEEE/ACM Transactions on Audio, Speech, and Language Processing 2018 Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël Habets

Estimating the maximum number of concurrent speakers from single-channel mixtures is a challenging problem and an essential first step to address various audio-based tasks such as blind source separation, speaker diarization, and audio surveillance.

blind source separation speaker-diarization +1

Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions

1 code implementation21 Jun 2018 Antoine Liutkus, Umut Şimşekli, Szymon Majewski, Alain Durmus, Fabian-Robert Stöter

To the best of our knowledge, the proposed algorithm is the first nonparametric IGM algorithm with explicit theoretical guarantees.

An Overview of Lead and Accompaniment Separation in Music

no code implementations23 Apr 2018 Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry FitzGerald, Bryan Pardo

For model-based methods, we organize them according to whether they concentrate on the lead signal, the accompaniment, or both.

Sound Audio and Speech Processing

The 2018 Signal Separation Evaluation Campaign

no code implementations17 Apr 2018 Fabian-Robert Stöter, Antoine Liutkus, Nobutaka Ito

This paper reports the organization and results for the 2018 community-based Signal Separation Evaluation Campaign (SiSEC 2018).

Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

1 code implementation12 Dec 2017 Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël. A. P. Habets

The task of estimating the maximum number of concurrent speakers from single channel mixtures is important for various audio-based applications, such as blind source separation, speaker diarisation, audio surveillance or auditory scene classification.

Audio and Speech Processing Sound

Cannot find the paper you are looking for? You can Submit a new open access paper.