Search Results for author: Alexander Lerch

Found 29 papers, 16 papers with code

Embedding Compression for Teacher-to-Student Knowledge Transfer

no code implementations9 Feb 2024 Yiwei Ding, Alexander Lerch

Common knowledge distillation methods require the teacher model and the student model to be trained on the same task.

Knowledge Distillation Transfer Learning

ASPED: An Audio Dataset for Detecting Pedestrians

no code implementations12 Sep 2023 Pavan Seshadri, Chaeyeon Han, Bon-Woo Koo, Noah Posner, Subhrajit Guhathakurta, Alexander Lerch

We introduce the new audio analysis task of pedestrian detection and present a new large-scale dataset for this task.

Pedestrian Detection

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

1 code implementation5 Sep 2023 Karn N. Watcharasupat, Chih-Wei Wu, Yiwei Ding, Iroro Orife, Aaron J. Hipple, Phillip A. Williams, Scott Kramer, Alexander Lerch, William Wolcott

Cinematic audio source separation is a relatively new subtask of audio source separation, with the aim of extracting the dialogue, music, and effects stems from their mixture.

Audio Source Separation

Audio Embeddings as Teachers for Music Classification

1 code implementation30 Jun 2023 Yiwei Ding, Alexander Lerch

Music classification has been one of the most popular tasks in the field of music information retrieval.

Classification Information Retrieval +7

Quantifying Spatial Audio Quality Impairment

1 code implementation13 Jun 2023 Karn N. Watcharasupat, Alexander Lerch

Spatial audio quality is a highly multifaceted concept, with many interactions between environmental, geometrical, anatomical, psychological, and contextual considerations.

Audio Compression Music Source Separation

Music Instrument Classification Reprogrammed

1 code implementation15 Nov 2022 Hsin-Hung Chen, Alexander Lerch

The performance of approaches to Music Instrument Classification, a popular task in Music Information Retrieval, is often impacted and limited by the lack of availability of annotated data for training.

Classification Information Retrieval +2

Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming

1 code implementation2 Nov 2022 Yun-Ning Hung, Chao-Han Huck Yang, Pin-Yu Chen, Alexander Lerch

In this work, we introduce a novel method for leveraging pre-trained models for low-resource (music) classification based on the concept of Neural Model Reprogramming (NMR).

Classification Genre classification +3

Evaluating generative audio systems and their metrics

no code implementations31 Aug 2022 Ashvala Vinay, Alexander Lerch

Recent years have seen considerable advances in audio synthesis with deep generative models.

Audio Synthesis

Representation Learning for the Automatic Indexing of Sound Effects Libraries

1 code implementation18 Aug 2022 Alison B. Ma, Alexander Lerch

Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by databases that continually grow in size and undergo taxonomy updates.

Metric Learning Representation Learning

Feature-informed Embedding Space Regularization For Audio Classification

no code implementations10 Jun 2022 Yun-Ning Hung, Alexander Lerch

The workload is kept low during inference as the pre-trained features are only necessary for training.

Audio Classification

Scream Detection in Heavy Metal Music

no code implementations11 May 2022 Vedant Kalbag, Alexander Lerch

Harsh vocal effects such as screams or growls are far more common in heavy metal vocals than the traditionally sung vocal.

Classification

Feature-informed Latent Space Regularization for Music Source Separation

no code implementations17 Mar 2022 Yun-Ning Hung, Alexander Lerch

The integration of additional side information to improve music source separation has been investigated numerous times, e. g., by adding features to the input or by adding learning targets in a multi-task learning scenario.

Multi-Task Learning Music Source Separation

Latte: Cross-framework Python Package for Evaluation of Latent-Based Generative Models

1 code implementation20 Dec 2021 Karn N. Watcharasupat, Junyoung Lee, Alexander Lerch

Latte (for LATent Tensor Evaluation) is a Python library for evaluation of latent-based generative models in the fields of disentanglement learning and controllable generation.

Disentanglement

Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes

1 code implementation11 Oct 2021 Karn N. Watcharasupat, Alexander Lerch

Controllable music generation with deep generative models has become increasingly reliant on disentanglement learning techniques.

Disentanglement Music Generation

Improving Music Performance Assessment with Contrastive Learning

1 code implementation3 Aug 2021 Pavan Seshadri, Alexander Lerch

Several automatic approaches for objective music performance assessment (MPA) have been proposed in the past, however, existing systems are not yet capable of reliably predicting ratings with the same accuracy as professional judges.

Contrastive Learning regression +1

Is Disentanglement enough? On Latent Representations for Controllable Music Generation

1 code implementation1 Aug 2021 Ashis Pati, Alexander Lerch

The structure of the latent space with respect to the VAE-decoder plays an important role in boosting the ability of a generative model to manipulate different attributes.

Disentanglement Music Generation

Mind the beat: detecting audio onsets from EEG recordings of music listening

no code implementations12 Feb 2021 Ashvala Vinay, Alexander Lerch, Grace Leslie

We propose a deep learning approach to predicting audio event onsets in electroencephalogram (EEG) recorded from users as they listen to music.

EEG

Audio Content Analysis

no code implementations1 Jan 2021 Alexander Lerch

With a focus on Music Information Retrieval systems, this chapter defines musical audio content, introduces the general process of audio content analysis, and surveys basic approaches to audio content analysis.

Emotion Recognition General Classification +9

Melody-Conditioned Lyrics Generation with SeqGANs

no code implementations28 Oct 2020 Yihao Chen, Alexander Lerch

Automatic lyrics generation has received attention from both music and AI communities for years.

Multitask learning for instrument activation aware music source separation

no code implementations3 Aug 2020 Yun-Ning Hung, Alexander Lerch

Music source separation is a core task in music information retrieval which has seen a dramatic improvement in the past years.

Information Retrieval Music Information Retrieval +2

Score-informed Networks for Music Performance Assessment

1 code implementation1 Aug 2020 Jiawen Huang, Yun-Ning Hung, Ashis Pati, Siddharth Kumar Gururani, Alexander Lerch

The assessment of music performances in most cases takes into account the underlying musical score being performed.

Time Series Time Series Analysis

dMelodies: A Music Dataset for Disentanglement Learning

2 code implementations29 Jul 2020 Ashis Pati, Siddharth Gururani, Alexander Lerch

In this paper, we present a new symbolic music dataset that will help researchers working on disentanglement problems demonstrate the efficacy of their algorithms on diverse domains.

Benchmarking Disentanglement

Visual Attention for Musical Instrument Recognition

no code implementations17 Jun 2020 Karn Watcharasupat, Siddharth Gururani, Alexander Lerch

In the field of music information retrieval, the task of simultaneously identifying the presence or absence of multiple musical instruments in a polyphonic recording remains a hard problem.

Information Retrieval Instrument Recognition +2

Attribute-based Regularization of Latent Spaces for Variational Auto-Encoders

1 code implementation11 Apr 2020 Ashis Pati, Alexander Lerch

Selective manipulation of data attributes using deep generative models is an active area of research.

Attribute

Explicitly Conditioned Melody Generation: A Case Study with Interdependent RNNs

no code implementations10 Jul 2019 Benjamin Genchel, Ashis Pati, Alexander Lerch

In this study, we investigate the effects of explicitly conditioning deep generative models with musically relevant information.

An Attention Mechanism for Musical Instrument Recognition

1 code implementation9 Jul 2019 Siddharth Gururani, Mohit Sharma, Alexander Lerch

While the automatic recognition of musical instruments has seen significant progress, the task is still considered hard for music featuring multiple instruments as opposed to single instrument recordings.

Instrument Recognition

Learning to Traverse Latent Spaces for Musical Score Inpainting

1 code implementation2 Jul 2019 Ashis Pati, Alexander Lerch, Gaëtan Hadjeres

The designed model takes both past and future musical context into account and is capable of suggesting ways to connect them in a musically meaningful manner.

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

1 code implementation5 Dec 2017 Zhiqian Chen, Chih-Wei Wu, Yen-Cheng Lu, Alexander Lerch, Chang-Tien Lu

FusionGAN is a novel genre fusion framework for music generation that integrates the strengths of generative adversarial networks and dual learning.

Music Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.