Search Results for author: Max Morrison

Found 10 papers, 6 papers with code

High-Fidelity Neural Phonetic Posteriorgrams

1 code implementation27 Feb 2024 Cameron Churchwell, Max Morrison, Bryan Pardo

A phonetic posteriorgram (PPG) is a time-varying categorical distribution over acoustic units of speech (e. g., phonemes).

Voice Conversion

Crowdsourced and Automatic Speech Prominence Estimation

1 code implementation12 Oct 2023 Max Morrison, Pranav Pawar, Nathan Pruyne, Jennifer Cole, Bryan Pardo

Speech prominence estimation is the process of assigning a numeric value to the prominence of each word in an utterance.

Emotion Recognition

Music Separation Enhancement with Generative Modeling

no code implementations26 Aug 2022 Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo

Despite phenomenal progress in recent years, state-of-the-art music separation systems produce source estimates with significant perceptual shortcomings, such as adding extraneous noise or removing harmonics.

Music Source Separation

Reproducible Subjective Evaluation

1 code implementation8 Mar 2022 Max Morrison, Brian Tang, Gefei Tan, Bryan Pardo

ReSEval lets researchers launch A/B, ABX, Mean Opinion Score (MOS) and MUltiple Stimuli with Hidden Reference and Anchor (MUSHRA) tests on audio, image, text, or video data from a command-line interface or using one line of Python, making it as easy to run as objective evaluation.

Chunked Autoregressive GAN for Conditional Waveform Synthesis

1 code implementation ICLR 2022 Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron Courville, Yoshua Bengio

We show that simple pitch and periodicity conditioning is insufficient for reducing this error relative to using autoregression.

Inductive Bias

Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

1 code implementation5 Oct 2021 Max Morrison, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

Modifying the pitch and timing of an audio signal are fundamental audio editing operations with applications in speech manipulation, audio-visual synchronization, and singing voice editing and synthesis.

Audio-Visual Synchronization

Context-Aware Prosody Correction for Text-Based Speech Editing

no code implementations16 Feb 2021 Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

Text-based speech editors expedite the process of editing speech recordings by permitting editing via intuitive cut, copy, and paste operations on a speech transcript.

Denoising

Controllable Neural Prosody Synthesis

no code implementations7 Aug 2020 Max Morrison, Zeyu Jin, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore

Speech synthesis has recently seen significant improvements in fidelity, driven by the advent of neural vocoders and neural prosody generators.

Speech Synthesis

OtoMechanic: Auditory Automobile Diagnostics via Query-by-Example

no code implementations5 Nov 2019 Max Morrison, Bryan Pardo

Many automobile components in need of repair produce characteristic sounds.

Cannot find the paper you are looking for? You can Submit a new open access paper.