Search Results for author: Alexander Lerch

Found 29 papers, 16 papers with code

Embedding Compression for Teacher-to-Student Knowledge Transfer

no code implementations • 9 Feb 2024 • Yiwei Ding, Alexander Lerch

Common knowledge distillation methods require the teacher model and the student model to be trained on the same task.

Knowledge Distillation Transfer Learning

Paper
Add Code

ASPED: An Audio Dataset for Detecting Pedestrians

no code implementations • 12 Sep 2023 • Pavan Seshadri, Chaeyeon Han, Bon-Woo Koo, Noah Posner, Subhrajit Guhathakurta, Alexander Lerch

We introduce the new audio analysis task of pedestrian detection and present a new large-scale dataset for this task.

Pedestrian Detection

Paper
Add Code

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

1 code implementation • 5 Sep 2023 • Karn N. Watcharasupat, Chih-Wei Wu, Yiwei Ding, Iroro Orife, Aaron J. Hipple, Phillip A. Williams, Scott Kramer, Alexander Lerch, William Wolcott

Cinematic audio source separation is a relatively new subtask of audio source separation, with the aim of extracting the dialogue, music, and effects stems from their mixture.

Audio Source Separation

Paper
Code

Audio Embeddings as Teachers for Music Classification

1 code implementation • 30 Jun 2023 • Yiwei Ding, Alexander Lerch

Music classification has been one of the most popular tasks in the field of music information retrieval.

Ranked #2 on Instrument Recognition on OpenMIC-2018

Classification Information Retrieval +7

Paper
Code

Quantifying Spatial Audio Quality Impairment

1 code implementation • 13 Jun 2023 • Karn N. Watcharasupat, Alexander Lerch

Spatial audio quality is a highly multifaceted concept, with many interactions between environmental, geometrical, anatomical, psychological, and contextual considerations.

Audio Compression Music Source Separation

Paper
Code

Music Instrument Classification Reprogrammed

1 code implementation • 15 Nov 2022 • Hsin-Hung Chen, Alexander Lerch

The performance of approaches to Music Instrument Classification, a popular task in Music Information Retrieval, is often impacted and limited by the lack of availability of annotated data for training.

Classification Information Retrieval +2

Paper
Code

Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming

1 code implementation • 2 Nov 2022 • Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch

In this work, we introduce a novel method for leveraging pre-trained models for low-resource (music) classification based on the concept of Neural Model Reprogramming (NMR).

Classification Genre classification +3

Paper
Code

Evaluating generative audio systems and their metrics

no code implementations • 31 Aug 2022 • Ashvala Vinay, Alexander Lerch

Recent years have seen considerable advances in audio synthesis with deep generative models.

Audio Synthesis

Paper
Add Code

Representation Learning for the Automatic Indexing of Sound Effects Libraries

1 code implementation • 18 Aug 2022 • Alison B. Ma, Alexander Lerch

Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by databases that continually grow in size and undergo taxonomy updates.

Metric Learning Representation Learning

Paper
Code

Feature-informed Embedding Space Regularization For Audio Classification

no code implementations • 10 Jun 2022 • Yun-Ning Hung, Alexander Lerch

The workload is kept low during inference as the pre-trained features are only necessary for training.

Audio Classification

Paper
Add Code

Scream Detection in Heavy Metal Music

no code implementations • 11 May 2022 • Vedant Kalbag, Alexander Lerch

Harsh vocal effects such as screams or growls are far more common in heavy metal vocals than the traditionally sung vocal.

Classification

Paper
Add Code

Feature-informed Latent Space Regularization for Music Source Separation

no code implementations • 17 Mar 2022 • Yun-Ning Hung, Alexander Lerch

The integration of additional side information to improve music source separation has been investigated numerous times, e. g., by adding features to the input or by adding learning targets in a multi-task learning scenario.

Multi-Task Learning Music Source Separation

Paper
Add Code

Latte: Cross-framework Python Package for Evaluation of Latent-Based Generative Models

1 code implementation • 20 Dec 2021 • Karn N. Watcharasupat, Junyoung Lee, Alexander Lerch

Latte (for LATent Tensor Evaluation) is a Python library for evaluation of latent-based generative models in the fields of disentanglement learning and controllable generation.

Disentanglement

Paper
Code

Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes

1 code implementation • 11 Oct 2021 • Karn N. Watcharasupat, Alexander Lerch

Controllable music generation with deep generative models has become increasingly reliant on disentanglement learning techniques.

Disentanglement Music Generation

Paper
Code

Improving Music Performance Assessment with Contrastive Learning

1 code implementation • 3 Aug 2021 • Pavan Seshadri, Alexander Lerch

Several automatic approaches for objective music performance assessment (MPA) have been proposed in the past, however, existing systems are not yet capable of reliably predicting ratings with the same accuracy as professional judges.

Contrastive Learning regression +1

Paper
Code

Is Disentanglement enough? On Latent Representations for Controllable Music Generation

1 code implementation • 1 Aug 2021 • Ashis Pati, Alexander Lerch

The structure of the latent space with respect to the VAE-decoder plays an important role in boosting the ability of a generative model to manipulate different attributes.

Disentanglement Music Generation

Paper
Code

Mind the beat: detecting audio onsets from EEG recordings of music listening

no code implementations • 12 Feb 2021 • Ashvala Vinay, Alexander Lerch, Grace Leslie

We propose a deep learning approach to predicting audio event onsets in electroencephalogram (EEG) recorded from users as they listen to music.

EEG

Paper
Add Code

Audio Content Analysis

no code implementations • 1 Jan 2021 • Alexander Lerch

With a focus on Music Information Retrieval systems, this chapter defines musical audio content, introduces the general process of audio content analysis, and surveys basic approaches to audio content analysis.

Emotion Recognition General Classification +9

Paper
Add Code

Melody-Conditioned Lyrics Generation with SeqGANs

no code implementations • 28 Oct 2020 • Yihao Chen, Alexander Lerch

Automatic lyrics generation has received attention from both music and AI communities for years.

Paper
Add Code

Multitask learning for instrument activation aware music source separation

no code implementations • 3 Aug 2020 • Yun-Ning Hung, Alexander Lerch

Music source separation is a core task in music information retrieval which has seen a dramatic improvement in the past years.

Information Retrieval Music Information Retrieval +2

Paper
Add Code

Score-informed Networks for Music Performance Assessment

1 code implementation • 1 Aug 2020 • Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch

The assessment of music performances in most cases takes into account the underlying musical score being performed.

Time Series Time Series Analysis

Paper
Code

dMelodies: A Music Dataset for Disentanglement Learning

2 code implementations • 29 Jul 2020 • Ashis Pati, Siddharth Gururani, Alexander Lerch

In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.

Benchmarking Disentanglement

Paper
Code

Visual Attention for Musical Instrument Recognition

no code implementations • 17 Jun 2020 • Karn Watcharasupat, Siddharth Gururani, Alexander Lerch

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.

Information Retrieval Instrument Recognition +2

Paper
Add Code

Attribute-based Regularization of Latent Spaces for Variational Auto-Encoders

1 code implementation • 11 Apr 2020 • Ashis Pati, Alexander Lerch

Selective manipulation of data attributes using deep generative models is an active area of research.

Attribute

Paper
Code

Explicitly Conditioned Melody Generation: A Case Study with Interdependent RNNs

no code implementations • 10 Jul 2019 • Benjamin Genchel, Ashis Pati, Alexander Lerch

In this study, we investigate the effects of explicitly conditioning deep generative models with musically relevant information.

Paper
Add Code

An Attention Mechanism for Musical Instrument Recognition

1 code implementation • 9 Jul 2019 • Siddharth Gururani, Mohit Sharma, Alexander Lerch

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.

Instrument Recognition

Paper
Code

Learning to Traverse Latent Spaces for Musical Score Inpainting

1 code implementation • 2 Jul 2019 • Ashis Pati, Alexander Lerch, Gaëtan Hadjeres

The designed model takes both past and future musical context into account and is capable of suggesting ways to connect them in a musically meaningful manner.

Paper
Code

Music Performance Analysis: A Survey

no code implementations • 29 Jun 2019 • Alexander Lerch, Claire Arthur, Ashis Pati, Siddharth Gururani

Music Information Retrieval (MIR) tends to focus on the analysis of audio signals.

Information Retrieval Music Information Retrieval +1

Paper
Add Code

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

1 code implementation • 5 Dec 2017 • Zhiqian Chen, Chih-Wei Wu, Yen-Cheng Lu, Alexander Lerch, Chang-Tien Lu

FusionGAN is a novel genre fusion framework for music generation that integrates the strengths of generative adversarial networks and dual learning.

Music Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.