Search Results for author: Bryan Pardo

Despite phenomenal progress in recent years, state-of-the-art music separation systems produce source estimates with significant perceptual shortcomings, such as adding extraneous noise or removing harmonics.

Music Source Separation

Paper
Add Code

Reproducible Subjective Evaluation

1 code implementation • 8 Mar 2022 • Max Morrison, Brian Tang, Gefei Tan, Bryan Pardo

ReSEval lets researchers launch A/B, ABX, Mean Opinion Score (MOS) and MUltiple Stimuli with Hidden Reference and Anchor (MUSHRA) tests on audio, image, text, or video data from a command-line interface or using one line of Python, making it as easy to run as objective evaluation.

Paper
Code

Deep Learning Tools for Audacity: Helping Researchers Expand the Artist's Toolkit

no code implementations • 25 Oct 2021 • Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow, Dmitry Vedenko, Bryan Pardo

We present a software framework that integrates neural networks into the popular open-source audio editing software, Audacity, with a minimal amount of developer effort.

Paper
Add Code

Unsupervised Source Separation By Steering Pretrained Music Models

1 code implementation • 25 Oct 2021 • Ethan Manilow, Patrick O'Reilly, Prem Seetharaman, Bryan Pardo

We showcase an unsupervised method that repurposes deep models trained for music generation and music tagging for audio source separation, without any retraining.

Audio Generation Audio Source Separation +3

Paper
Code

Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

1 code implementation • 5 Oct 2021 • Max Morrison, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

Modifying the pitch and timing of an audio signal are fundamental audio editing operations with applications in speech manipulation, audio-visual synchronization, and singing voice editing and synthesis.

Audio-Visual Synchronization

137

Paper
Code

Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition

1 code implementation • 14 Jul 2021 • Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow, Bryan Pardo

Deep learning work on musical instrument recognition has generally focused on instrument classes for which we have abundant data.

Few-Shot Learning Instrument Recognition

Paper
Code

Context-Aware Prosody Correction for Text-Based Speech Editing

no code implementations • 16 Feb 2021 • Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

Text-based speech editors expedite the process of editing speech recordings by permitting editing via intuitive cut, copy, and paste operations on a speech transcript.

Denoising

Paper
Add Code

A Study of Transfer Learning in Music Source Separation

no code implementations • 23 Oct 2020 • Andreas Bugler, Bryan Pardo, Prem Seetharaman

Supervised deep learning methods for performing audio source separation can be very effective in domains where there is a large amount of training data.

Audio Source Separation Data Augmentation +2

Paper
Add Code

Bespoke Neural Networks for Score-Informed Source Separation

no code implementations • 29 Sep 2020 • Ethan Manilow, Bryan Pardo

In this paper, we introduce a simple method that can separate arbitrary musical instruments from an audio mixture.

Paper
Add Code

AutoClip: Adaptive Gradient Clipping for Source Separation Networks

1 code implementation • 25 Jul 2020 • Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux

Clipping the gradient is a known approach to improving gradient descent, but requires hand selection of a clipping threshold hyperparameter.

Audio Source Separation

107

Paper
Code

Bach or Mock? A Grading Function for Chorales in the Style of J.S. Bach

1 code implementation • 23 Jun 2020 • Alexander Fang, Alisa Liu, Prem Seetharaman, Bryan Pardo

Deep generative systems that learn probabilistic models from a corpus of existing music do not explicitly encode knowledge of a musical style, compared to traditional rule-based systems.

Paper
Code

Incorporating Music Knowledge in Continual Dataset Augmentation for Music Generation

1 code implementation • 23 Jun 2020 • Alisa Liu, Alexander Fang, Gaëtan Hadjeres, Prem Seetharaman, Bryan Pardo

In this paper, we present augmentative generation (Aug-Gen), a method of dataset augmentation for any music generation system trained on a resource-constrained domain.

Music Generation

Paper
Code

OtoMechanic: Auditory Automobile Diagnostics via Query-by-Example

no code implementations • 5 Nov 2019 • Max Morrison, Bryan Pardo

Many automobile components in need of repair produce characteristic sounds.

Paper
Add Code

Bootstrapping deep music separation from primitive auditory grouping principles

no code implementations • 23 Oct 2019 • Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo

They are trained on synthetic mixtures of audio made from isolated sound source recordings so that ground truth for the separation is known.

Music Source Separation

Paper
Add Code

Model selection for deep audio source separation via clustering analysis

no code implementations • 23 Oct 2019 • Alisa Liu, Prem Seetharaman, Bryan Pardo

We compare our confidence-based ensemble approach to using individual models with no selection, to an oracle that always selects the best model and to a random model selector.

Audio Source Separation Clustering +1

Paper
Add Code

Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments

no code implementations • 22 Oct 2019 • Ethan Manilow, Prem Seetharaman, Bryan Pardo

We present a single deep learning architecture that can both separate an audio recording of a musical mixture into constituent single-instrument recordings and transcribe these instruments into a human-readable format at the same time, learning a shared musical representation for both tasks.

Paper
Add Code

Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures

no code implementations • 6 Nov 2018 • Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo

These estimates, together with a weighting scheme in the time-frequency domain, based on confidence in the separation quality, are used to train a deep learning model that can be used for single-channel separation, where no source direction information is available.

Clustering Image Segmentation +2

Paper
Add Code

VocalSet: A Singing Voice Dataset

no code implementations • International Society for Music Information Retrieval Conference 2018 • Julia Wilkins, Prem Seetharaman, Alison Wahl, Bryan Pardo

We present VocalSet, a singing voice dataset of a capella singing.

Singer Identification Vocal technique classification

Paper
Add Code

An Overview of Lead and Accompaniment Separation in Music

no code implementations • 23 Apr 2018 • Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry FitzGerald, Bryan Pardo

For model-based methods, we organize them according to whether they concentrate on the lead signal, the accompaniment, or both.

Sound Audio and Speech Processing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.