Search Results for author: George Tzanetakis

Found 8 papers, 6 papers with code

Estimating Visual Information From Audio Through Manifold Learning

1 code implementation3 Aug 2022 Fabrizio Pedersoli, Dryden Wiebe, Amin Banitalebi, Yong Zhang, George Tzanetakis, Kwang Moo Yi

Therefore, audio-based methods can be useful even for applications in which only visual information is of interest Our framework is based on Manifold Learning and consists of two steps.

Semantic Segmentation

One Billion Audio Sounds from GPU-enabled Modular Synthesis

1 code implementation27 Apr 2021 Joseph Turian, Jordie Shier, George Tzanetakis, Kirk McNally, Max Henry

We release synth1B1, a multi-modal audio corpus consisting of 1 billion 4-second synthesized sounds, paired with the synthesis parameters used to generate them.

Hyperparameter Optimization

Deep Autotuner: a Pitch Correcting Network for Singing Performances

1 code implementation12 Feb 2020 Sanna Wager, George Tzanetakis, Cheng-i Wang, Minje Kim

We train our neural network model using a dataset of 4, 702 amateur karaoke performances selected for good intonation.

Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances

no code implementations3 Feb 2019 Sanna Wager, George Tzanetakis, Cheng-i Wang, Lijiang Guo, Aswin Sivaraman, Minje Kim

This approach differs from commercially used automatic pitch correction systems, where notes in the vocal tracks are shifted to be centered around notes in a user-defined score or mapped to the closest pitch among the twelve equal-tempered scale degrees.

Espresso: Efficient Forward Propagation for Binary Deep Neural Networks

no code implementations ICLR 2018 Fabrizio Pedersoli, George Tzanetakis, Andrea Tagliasacchi

Binary Deep Neural Networks (BDNNs) have been shown to be an effective way of achieving this objective.

Espresso: Efficient Forward Propagation for BCNNs

1 code implementation19 May 2017 Fabrizio Pedersoli, George Tzanetakis, Andrea Tagliasacchi

In this paper, we show how Convolutional Neural Networks (CNNs) can be implemented using binary representations.

The Orchive : Data mining a massive bioacoustic archive

1 code implementation2 Jul 2013 Steven Ness, Helena Symonds, Paul Spong, George Tzanetakis

The Orchive is a large collection of over 20, 000 hours of audio recordings from the OrcaLab research facility located off the northern tip of Vancouver Island.

Cannot find the paper you are looking for? You can Submit a new open access paper.