Search Results for author: George Tzanetakis

Found 8 papers, 6 papers with code

Estimating Visual Information From Audio Through Manifold Learning

1 code implementation • 3 Aug 2022 • Fabrizio Pedersoli, Dryden Wiebe, Amin Banitalebi, Yong Zhang, George Tzanetakis, Kwang Moo Yi

Therefore, audio-based methods can be useful even for applications in which only visual information is of interest Our framework is based on Manifold Learning and consists of two steps.

Semantic Segmentation

Paper
Code

HEAR: Holistic Evaluation of Audio Representations

3 code implementations • 6 Mar 2022 • Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios.

Open-Ended Question Answering

Paper
Code

One Billion Audio Sounds from GPU-enabled Modular Synthesis

1 code implementation • 27 Apr 2021 • Joseph Turian, Jordie Shier, George Tzanetakis, Kirk McNally, Max Henry

We release synth1B1, a multi-modal audio corpus consisting of 1 billion 4-second synthesized sounds, paired with the synthesis parameters used to generate them.

Hyperparameter Optimization

319

Paper
Code

Deep Autotuner: a Pitch Correcting Network for Singing Performances

1 code implementation • 12 Feb 2020 • Sanna Wager, George Tzanetakis, Cheng-i Wang, Minje Kim

We train our neural network model using a dataset of 4, 702 amateur karaoke performances selected for good intonation.

144

Paper
Code

Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances

no code implementations • 3 Feb 2019 • Sanna Wager, George Tzanetakis, Cheng-i Wang, Lijiang Guo, Aswin Sivaraman, Minje Kim

This approach differs from commercially used automatic pitch correction systems, where notes in the vocal tracks are shifted to be centered around notes in a user-defined score or mapped to the closest pitch among the twelve equal-tempered scale degrees.

Paper
Add Code

Espresso: Efficient Forward Propagation for Binary Deep Neural Networks

no code implementations • ICLR 2018 • Fabrizio Pedersoli, George Tzanetakis, Andrea Tagliasacchi

Binary Deep Neural Networks (BDNNs) have been shown to be an effective way of achieving this objective.

Paper
Add Code

Espresso: Efficient Forward Propagation for BCNNs

1 code implementation • 19 May 2017 • Fabrizio Pedersoli, George Tzanetakis, Andrea Tagliasacchi

In this paper, we show how Convolutional Neural Networks (CNNs) can be implemented using binary representations.

Paper
Code

The Orchive : Data mining a massive bioacoustic archive

1 code implementation • 2 Jul 2013 • Steven Ness, Helena Symonds, Paul Spong, George Tzanetakis

The Orchive is a large collection of over 20, 000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.