Search Results for author: Brian McFee

Found 13 papers, 8 papers with code

Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation

1 code implementation23 Oct 2024 Junwon Lee, Modan Tailleur, Laurie M. Heller, Keunwoo Choi, Mathieu Lagrange, Brian McFee, Keisuke Imoto, Yuki Okamoto

Despite significant advancements in neural text-to-audio generation, challenges persist in controllability and evaluation.

Audio Generation

Leveraging Geometrical Acoustic Simulations of Spatial Room Impulse Responses for Improved Sound Event Detection and Localization

1 code implementation6 Sep 2023 Christopher Ick, Brian McFee

As deeper and more complex models are developed for the task of sound event localization and detection (SELD), the demand for annotated spatial audio data continues to increase.

Event Detection Sound Event Detection +1

Transfer Learning and Bias Correction with Pre-trained Audio Embeddings

1 code implementation20 Jul 2023 Changhong Wang, Gaël Richard, Brian McFee

This approach allows representations derived for one task to be applied to another, and can result in high accuracy with less stringent training data requirements for the downstream task.

Information Retrieval Instrument Recognition +3

A Proposal for Foley Sound Synthesis Challenge

no code implementations21 Jul 2022 Keunwoo Choi, Sangshin Oh, Minsung Kang, Brian McFee

"Foley" refers to sound effects that are added to multimedia during post-production to enhance its perceived acoustic properties, e. g., by simulating the sounds of footsteps, ambient environmental sounds, or visible objects on the screen.

Sound Event Detection in Urban Audio With Single and Multi-Rate PCEN

no code implementations6 Feb 2021 Christopher Ick, Brian McFee

Recent literature has demonstrated that the use of per-channel energy normalization (PCEN), has significant performance improvements over traditional log-scaled mel-frequency spectrograms in acoustic sound event detection (SED) in a multi-class setting with overlapping events.

Event Detection Sound Event Detection

Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

1 code implementation9 Sep 2020 Helena Cuesta, Brian McFee, Emilia Gómez

This paper addresses the extraction of multiple F0 values from polyphonic and a cappella vocal performances using convolutional neural networks (CNNs).

Learning the helix topology of musical pitch

1 code implementation22 Oct 2019 Vincent Lostanlen, Sripathi Sridhar, Brian McFee, Andrew Farnsworth, Juan Pablo Bello

To explain the consonance of octaves, music psychologists represent pitch as a helix where azimuth and axial coordinate correspond to pitch class and pitch height respectively.

Multitask Learning for Fundamental Frequency Estimation in Music

1 code implementation2 Sep 2018 Rachel M. Bittner, Brian McFee, Juan P. Bello

Fundamental frequency (f0) estimation from polyphonic music includes the tasks of multiple-f0, melody, vocal, and bass line estimation.

Adaptive pooling operators for weakly labeled sound event detection

2 code implementations26 Apr 2018 Brian McFee, Justin Salamon, Juan Pablo Bello

In this work, we treat SED as a multiple instance learning (MIL) problem, where training labels are static over a short excerpt, indicating the presence or absence of sound sources but not their temporal locality.

Event Detection Multiple Instance Learning +2

Codebook based Audio Feature Representation for Music Information Retrieval

no code implementations19 Dec 2013 Yonatan Vaizman, Brian McFee, Gert Lanckriet

Automated recommendation systems are essential for users to discover music they love and for artists to reach appropriate audience.

Information Retrieval Management +5

Cannot find the paper you are looking for? You can Submit a new open access paper.