1 code implementation • 12 Aug 2024 • Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
This enables the comparison of surprisal across different musical content even if the musical events occur in irregular time intervals.
1 code implementation • 12 Aug 2024 • Marco Pasini, Stefan Lattner, George Fazekas
We demonstrate that Music2Latent outperforms existing continuous audio autoencoders in sound quality and reconstruction accuracy while achieving competitive performance on downstream MIR tasks using its latent representations.
1 code implementation • 5 Aug 2024 • Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Michael Anslow, Geoffroy Peeters
This paper explores the automated process of determining stem compatibility by identifying audio recordings of single instruments that blend well with a given musical context.
no code implementations • 12 Jun 2024 • Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner
Recent advancements in deep generative models present new opportunities for music production but also pose challenges, such as high computational demands and limited audio quality.
1 code implementation • 14 May 2024 • Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
This paper addresses the problem of self-supervised general-purpose audio representation learning.
no code implementations • 2 Feb 2024 • Marco Pasini, Maarten Grachten, Stefan Lattner
At the core of our method are audio autoencoders that efficiently compress audio waveform samples into invertible latent representations, and a conditional latent diffusion model that takes as input the latent encoding of a mix and generates the latent encoding of a corresponding stem.
1 code implementation • International Society of Music Information Retrieval 2023 • Bernardo Torres, Stefan Lattner, Gaël Richard
Significant strides have been made in creating voice identity representations using speech data.
no code implementations • 5 Sep 2023 • Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
In this paper, we address the problem of pitch estimation using Self Supervised Learning (SSL).
no code implementations • 18 Aug 2023 • Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
To accomplish this, we train a high-capacity transformer model on a vast collection of highly-structured Irish folk melodies and analyze the musical qualities of the samples generated using distribution truncation sampling techniques.
no code implementations • 23 Nov 2022 • Mathias Rose Bjare, Stefan Lattner
It has been shown in a recent publication that words in human-produced English language tend to have an information content close to the conditional entropy.
no code implementations • 1 Aug 2022 • Stefan Lattner
Artists must use their aesthetic judgement to assess whether a given drum sample fits the current musical context.
no code implementations • 29 Jun 2022 • Javier Nistal, Cyran Aouameur, Ithan Velarde, Stefan Lattner
In contemporary popular music production, drum sound design is commonly performed by cumbersome browsing and processing of pre-recorded samples in sound libraries.
1 code implementation • 4 May 2021 • Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard
Influenced by the field of Computer Vision, Generative Adversarial Networks (GANs) are often adopted for the audio domain using fixed-size two-dimensional spectrogram representations as the "image data".
no code implementations • 6 Jan 2020 • Stefan Lattner
Furthermore, I motivate the relevance of musical transformations in structure modeling and show how a connectionist model, the Gated Autoencoder (GAE), can be employed to learn transformations between musical fragments.
no code implementations • 2 Aug 2019 • Stefan Lattner, Maarten Grachten
Here we propose a model for conditional kick drum track generation that takes existing musical material as input, in addition to a low-dimensional code that encodes the desired relation between the existing material and the new material to be generated.
1 code implementation • 13 Jul 2019 • Stefan Lattner, Monika Dörfler, Andreas Arzt
Mapping signals onto complex basis functions learned by the CAE results in a transformation-invariant "magnitude space" and a transformation-variant "phase space".
no code implementations • 22 Jun 2018 • Stefan Lattner, Maarten Grachten, Gerhard Widmer
We show that the RGAE improves the state of the art for general connectionist sequence models in learning to predict monophonic melodies, and that ensembles of relative and absolute music processing models improve the results appreciably.
no code implementations • 21 Jun 2018 • Stefan Lattner, Maarten Grachten, Gerhard Widmer
Many music theoretical constructs (such as scale types, modes, cadences, and chord types) are defined in terms of pitch intervals---relative distances between pitches.
no code implementations • 17 Aug 2017 • Stefan Lattner, Maarten Grachten, Gerhard Widmer
Music is usually highly structured and it is still an open question how to design models which can successfully learn to recognize and represent musical structure.
no code implementations • 5 Jul 2017 • Stefan Lattner, Maarten Grachten
Content-invariance in mapping codes learned by GAEs is a useful feature for various relation learning tasks.
no code implementations • 14 Dec 2016 • Stefan Lattner, Maarten Grachten, Gerhard Widmer
We introduce a method for imposing higher-level structure on generated, polyphonic music.