Search Results for author: Dominik Roblek

Found 12 papers, 3 papers with code

One-shot conditional audio filtering of arbitrary sounds

no code implementations4 Nov 2020 Beat Gfeller, Dominik Roblek, Marco Tagliasacchi

When trained on Librispeech, our model achieves an SI-SDR improvement of 14. 0 dB when separating one voice from a mixture of two speakers.

SEANet: A Multi-modal Speech Enhancement Network

1 code implementation4 Sep 2020 Marco Tagliasacchi, Yunpeng Li, Karolis Misiunas, Dominik Roblek

We explore the possibility of leveraging accelerometer data to perform speech enhancement in very noisy conditions.

Speech Enhancement

Learning to Denoise Historical Music

no code implementations5 Aug 2020 Yunpeng Li, Beat Gfeller, Marco Tagliasacchi, Dominik Roblek

We propose an audio-to-audio neural network model that learns to denoise old music recordings.

Training Keyword Spotters with Limited and Synthesized Speech Data

no code implementations31 Jan 2020 James Lin, Kevin Kilgour, Dominik Roblek, Matthew Sharifi

With the rise of low power speech-enabled devices, there is a growing demand to quickly produce models for recognizing arbitrary sets of keywords.

Ranked #10 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 12 metric)

Keyword Spotting

SPICE: Self-supervised Pitch Estimation

no code implementations25 Oct 2019 Beat Gfeller, Christian Frank, Dominik Roblek, Matt Sharifi, Marco Tagliasacchi, Mihajlo Velimirović

We propose a model to estimate the fundamental frequency in monophonic audio, often referred to as pitch estimation.

Self-Supervised Learning Translation

Learning audio representations via phase prediction

no code implementations25 Oct 2019 Félix de Chaumont Quitry, Marco Tagliasacchi, Dominik Roblek

We learn audio representations by solving a novel self-supervised learning task, which consists of predicting the phase of the short-time Fourier transform from its magnitude.

Self-Supervised Learning

From Here to There: Video Inbetweening Using Direct 3D Convolutions

1 code implementation24 May 2019 Yunpeng Li, Dominik Roblek, Marco Tagliasacchi

We first obtain a latent video representation using a stochastic fusion mechanism that learns how to incorporate information from the start and end frames.

Video Generation

Self-supervised audio representation learning for mobile devices

no code implementations24 May 2019 Marco Tagliasacchi, Beat Gfeller, Félix de Chaumont Quitry, Dominik Roblek

We explore self-supervised models that can be potentially deployed on mobile devices to learn general purpose audio representations.

Federated Learning Representation Learning +2

An Empirical Study of Generative Models with Encoders

no code implementations19 Dec 2018 Paul K. Rubenstein, Yunpeng Li, Dominik Roblek

Generative adversarial networks (GANs) are capable of producing high quality image samples.

Now Playing: Continuous low-power music recognition

no code implementations29 Nov 2017 Blaise Agüera y Arcas, Beat Gfeller, Ruiqi Guo, Kevin Kilgour, Sanjiv Kumar, James Lyon, Julian Odell, Marvin Ritter, Dominik Roblek, Matthew Sharifi, Mihajlo Velimirović

To reduce battery consumption, a small music detector runs continuously on the mobile device's DSP chip and wakes up the main application processor only when it is confident that music is present.

Cannot find the paper you are looking for? You can Submit a new open access paper.