Search Results for author: Fabian-Robert Stöter

Found 9 papers, 7 papers with code

Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark

1 code implementation • 23 Nov 2023 • Ondřej Cífka, Constantinos Dimitriou, Cheng-i Wang, Hendrik Schreiber, Luke Miner, Fabian-Robert Stöter

Current automatic lyrics transcription (ALT) benchmarks focus exclusively on word content and ignore the finer nuances of written lyrics including formatting and punctuation, which leads to a potential misalignment with the creative products of musicians and songwriters as well as listeners' experiences.

Ranked #1 on Automatic Lyrics Transcription on Jam-ALT Spanish

Automatic Lyrics Transcription

Paper
Code

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

2 code implementations • 14 Aug 2023 • Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco Martínez-Ramírez, WeiHsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada Mohanty, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji

We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce two new datasets that simulate such errors: SDXDB23_LabelNoise and SDXDB23_Bleeding.

Music Source Separation

486

Paper
Code

Music Demixing Challenge 2021

1 code implementation • 31 Aug 2021 • Yuki Mitsufuji, Giorgio Fabbro, Stefan Uhlich, Fabian-Robert Stöter, Alexandre Défossez, Minseok Kim, Woosung Choi, Chin-Yun Yu, Kin-Wai Cheuk

The main differences compared with the past challenges are 1) the competition is designed to more easily allow machine learning practitioners from other disciplines to participate, 2) evaluation is done on a hidden test set created by music professionals dedicated exclusively to the challenge to assure the transparency of the challenge, i. e., the test set is not accessible from anyone except the challenge organizers, and 3) the dataset provides a wider range of music genres and involved a greater number of mixing engineers.

Music Source Separation

Paper
Code

Open-Unmix - A Reference Implementation for Music Source Separation

1 code implementation • The Journal of Open Source Software 2019 • Fabian-Robert Stöter, Stefan Uhlich, Antoine Liutkus, and YukiMitsufuji

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

Ranked #15 on Music Source Separation on MUSDB18 (using extra training data)

Music Source Separation

1,158

Paper
Code

CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning Speaker Count Estimation

1 code implementation • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2018 • Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël Habets

Estimating the maximum number of concurrent speakers from single-channel mixtures is a challenging problem and an essential first step to address various audio-based tasks such as blind source separation, speaker diarization, and audio surveillance.

blind source separation speaker-diarization +1

141

Paper
Code

Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions

1 code implementation • 21 Jun 2018 • Antoine Liutkus, Umut Şimşekli, Szymon Majewski, Alain Durmus, Fabian-Robert Stöter

To the best of our knowledge, the proposed algorithm is the first nonparametric IGM algorithm with explicit theoretical guarantees.

Paper
Code

An Overview of Lead and Accompaniment Separation in Music

no code implementations • 23 Apr 2018 • Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry FitzGerald, Bryan Pardo

For model-based methods, we organize them according to whether they concentrate on the lead signal, the accompaniment, or both.

Sound Audio and Speech Processing

Paper
Add Code

The 2018 Signal Separation Evaluation Campaign

no code implementations • 17 Apr 2018 • Fabian-Robert Stöter, Antoine Liutkus, Nobutaka Ito

This paper reports the organization and results for the 2018 community-based Signal Separation Evaluation Campaign (SiSEC 2018).

Paper
Add Code

Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

1 code implementation • 12 Dec 2017 • Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël. A. P. Habets

The task of estimating the maximum number of concurrent speakers from single channel mixtures is important for various audio-based applications, such as blind source separation, speaker diarisation, audio surveillance or auditory scene classification.

Audio and Speech Processing Sound

141

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.