Search Results for author: Mark Cartwright

Found 5 papers, 2 papers with code

A General Framework for Learning Procedural Audio Models of Environmental Sounds

no code implementations4 Mar 2023 Danzel Serrano, Mark Cartwright

This paper introduces the Procedural (audio) Variational autoEncoder (ProVE) framework as a general approach to learning Procedural Audio PA models of environmental sounds with an improvement to the realism of the synthesis while maintaining provision of control over the generated sound through adjustable parameters.

FAD

A Study on Robustness to Perturbations for Representations of Environmental Sound

no code implementations20 Mar 2022 Sangeeta Srivastava, Ho-Hsiang Wu, Joao Rulff, Magdalena Fuentes, Mark Cartwright, Claudio Silva, Anish Arora, Juan Pablo Bello

To accomplish this, we imitate channel effects by injecting perturbations to the audio signal and measure the shift in the new (perturbed) embeddings with three distance measures, making the evaluation domain-dependent but not task-dependent.

FAD Transfer Learning

Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes

1 code implementation6 May 2021 Aurora Cramer, Mark Cartwright, Fatemeh Pishdadian, Juan Pablo Bello

While the estimation of what sound sources are, when they occur, and from where they originate has been well-studied, the estimation of how loud these sound sources are has been often overlooked.

SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

no code implementations11 Sep 2020 Mark Cartwright, Jason Cramer, Ana Elisa Mendez Mendez, Yu Wang, Ho-Hsiang Wu, Vincent Lostanlen, Magdalena Fuentes, Graham Dove, Charlie Mydlarz, Justin Salamon, Oded Nov, Juan Pablo Bello

In this article, we describe our data collection procedure and propose evaluation metrics for multilabel classification of urban sound tags.

Cannot find the paper you are looking for? You can Submit a new open access paper.