Search Results for author: Jonas Rohnke

Found 5 papers, 0 papers with code

Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech

no code implementations24 Oct 2021 Marek Strong, Jonas Rohnke, Antonio Bonafonte, Mateusz Łajszczak, Trevor Wood

We present a Split Vector Quantized Variational Autoencoder (SVQ-VAE) architecture using a split vector quantizer for NTTS, as an enhancement to the well-known Variational Autoencoder (VAE) and Vector Quantized Variational Autoencoder (VQ-VAE) architectures.

Parallel WaveNet conditioned on VAE latent vectors

no code implementations17 Dec 2020 Jonas Rohnke, Tom Merritt, Jaime Lorenzo-Trueba, Adam Gabrys, Vatsal Aggarwal, Alexis Moinet, Roberto Barra-Chicote

In this paper we investigate the use of a sentence-level conditioning vector to improve the signal quality of a Parallel WaveNet neural vocoder.

Sentence Speech Synthesis +1

Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection

no code implementations2 Dec 2019 Shubhi Tyagi, Marco Nicolis, Jonas Rohnke, Thomas Drugman, Jaime Lorenzo-Trueba

Recent advances in Text-to-Speech (TTS) have improved quality and naturalness to near-human capabilities when considering isolated sentences.

Speech Synthesis

Fine-grained robust prosody transfer for single-speaker neural text-to-speech

no code implementations4 Jul 2019 Viacheslav Klimkov, Srikanth Ronanki, Jonas Rohnke, Thomas Drugman

However, when trained on a single-speaker dataset, the conventional prosody transfer systems are not robust enough to speaker variability, especially in the case of a reference signal coming from an unseen speaker.

Cannot find the paper you are looking for? You can Submit a new open access paper.