no code implementations • 12 Sep 2024 • T. Aleksandra Ma, Alexander Lerch
We propose to integrate few-shot learning methodology into multi-label music auto-tagging by using features from pre-trained models as inputs to a lightweight linear classifier, also known as a linear probe.
1 code implementation • 26 Jun 2024 • Karn N. Watcharasupat, Alexander Lerch
Of the very few current systems that support source separation beyond this setup, most continue to rely on an inflexible decoder setup that can only support a fixed pre-defined set of stems.
no code implementations • 14 Jun 2024 • Chaeyeon Han, Pavan Seshadri, Yiwei Ding, Noah Posner, Bon Woo Koo, Animesh Agrawal, Alexander Lerch, Subhrajit Guhathakurta
This study discusses a new approach to scale up urban sensing of people with the help of novel audio-based technology.
no code implementations • 9 Feb 2024 • Yiwei Ding, Alexander Lerch
Common knowledge distillation methods require the teacher model and the student model to be trained on the same task.
no code implementations • 12 Sep 2023 • Pavan Seshadri, Chaeyeon Han, Bon-Woo Koo, Noah Posner, Subhrajit Guhathakurta, Alexander Lerch
We introduce the new audio analysis task of pedestrian detection and present a new large-scale dataset for this task.
1 code implementation • 5 Sep 2023 • Karn N. Watcharasupat, Chih-Wei Wu, Yiwei Ding, Iroro Orife, Aaron J. Hipple, Phillip A. Williams, Scott Kramer, Alexander Lerch, William Wolcott
Cinematic audio source separation is a relatively new subtask of audio source separation, with the aim of extracting the dialogue, music, and effects stems from their mixture.
1 code implementation • 30 Jun 2023 • Yiwei Ding, Alexander Lerch
Music classification has been one of the most popular tasks in the field of music information retrieval.
Ranked #2 on Instrument Recognition on OpenMIC-2018
1 code implementation • 13 Jun 2023 • Karn N. Watcharasupat, Alexander Lerch
Spatial audio quality is a highly multifaceted concept, with many interactions between environmental, geometrical, anatomical, psychological, and contextual considerations.
1 code implementation • 15 Nov 2022 • Hsin-Hung Chen, Alexander Lerch
The performance of approaches to Music Instrument Classification, a popular task in Music Information Retrieval, is often impacted and limited by the lack of availability of annotated data for training.
1 code implementation • 2 Nov 2022 • Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch
In this work, we introduce a novel method for leveraging pre-trained models for low-resource (music) classification based on the concept of Neural Model Reprogramming (NMR).
no code implementations • 31 Aug 2022 • Ashvala Vinay, Alexander Lerch
Recent years have seen considerable advances in audio synthesis with deep generative models.
1 code implementation • 18 Aug 2022 • Alison B. Ma, Alexander Lerch
Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by databases that continually grow in size and undergo taxonomy updates.
no code implementations • 10 Jun 2022 • Yun-Ning Hung, Alexander Lerch
The workload is kept low during inference as the pre-trained features are only necessary for training.
no code implementations • 11 May 2022 • Vedant Kalbag, Alexander Lerch
Harsh vocal effects such as screams or growls are far more common in heavy metal vocals than the traditionally sung vocal.
no code implementations • 17 Mar 2022 • Yun-Ning Hung, Alexander Lerch
The integration of additional side information to improve music source separation has been investigated numerous times, e. g., by adding features to the input or by adding learning targets in a multi-task learning scenario.
1 code implementation • 20 Dec 2021 • Karn N. Watcharasupat, Junyoung Lee, Alexander Lerch
Latte (for LATent Tensor Evaluation) is a Python library for evaluation of latent-based generative models in the fields of disentanglement learning and controllable generation.
1 code implementation • 11 Oct 2021 • Karn N. Watcharasupat, Alexander Lerch
Controllable music generation with deep generative models has become increasingly reliant on disentanglement learning techniques.
1 code implementation • 3 Aug 2021 • Pavan Seshadri, Alexander Lerch
Several automatic approaches for objective music performance assessment (MPA) have been proposed in the past, however, existing systems are not yet capable of reliably predicting ratings with the same accuracy as professional judges.
1 code implementation • 1 Aug 2021 • Ashis Pati, Alexander Lerch
The structure of the latent space with respect to the VAE-decoder plays an important role in boosting the ability of a generative model to manipulate different attributes.
no code implementations • 12 Feb 2021 • Ashvala Vinay, Alexander Lerch, Grace Leslie
We propose a deep learning approach to predicting audio event onsets in electroencephalogram (EEG) recorded from users as they listen to music.
no code implementations • 1 Jan 2021 • Alexander Lerch
With a focus on Music Information Retrieval systems, this chapter defines musical audio content, introduces the general process of audio content analysis, and surveys basic approaches to audio content analysis.
no code implementations • 28 Oct 2020 • Yihao Chen, Alexander Lerch
Automatic lyrics generation has received attention from both music and AI communities for years.
no code implementations • 3 Aug 2020 • Yun-Ning Hung, Alexander Lerch
Music source separation is a core task in music information retrieval which has seen a dramatic improvement in the past years.
1 code implementation • 1 Aug 2020 • Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch
The assessment of music performances in most cases takes into account the underlying musical score being performed.
2 code implementations • 29 Jul 2020 • Ashis Pati, Siddharth Gururani, Alexander Lerch
In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.
no code implementations • 17 Jun 2020 • Karn Watcharasupat, Siddharth Gururani, Alexander Lerch
In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.
1 code implementation • 11 Apr 2020 • Ashis Pati, Alexander Lerch
Selective manipulation of data attributes using deep generative models is an active area of research.
no code implementations • 10 Jul 2019 • Benjamin Genchel, Ashis Pati, Alexander Lerch
In this study, we investigate the effects of explicitly conditioning deep generative models with musically relevant information.
1 code implementation • 9 Jul 2019 • Siddharth Gururani, Mohit Sharma, Alexander Lerch
While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.
1 code implementation • 2 Jul 2019 • Ashis Pati, Alexander Lerch, Gaëtan Hadjeres
The designed model takes both past and future musical context into account and is capable of suggesting ways to connect them in a musically meaningful manner.
no code implementations • 29 Jun 2019 • Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani
Music Information Retrieval (MIR) tends to focus on the analysis of audio signals.
1 code implementation • 5 Dec 2017 • Zhiqian Chen, Chih-Wei Wu, Yen-Cheng Lu, Alexander Lerch, Chang-Tien Lu
FusionGAN is a novel genre fusion framework for music generation that integrates the strengths of generative adversarial networks and dual learning.