Browse > Speech > Speech Synthesis

Speech Synthesis

36 papers with code ยท Speech

Speech synthesis is the task of generating speech from text.

Please note that the leaderboards here are not really comparable between studies - as they use mean opinion score as a metric and collect different samples from Amazon Mechnical Turk.

( Image credit: WaveNet: A generative model for raw audio )

Leaderboards

Latest papers without code

Excitation-based Voice Quality Analysis and Modification

2 Jan 2020

This paper investigates the differences occuring in the excitation for different voice qualities.

SPEECH SYNTHESIS

Eigenresiduals for improved Parametric Speech Synthesis

2 Jan 2020

Statistical parametric speech synthesizers have recently shown their ability to produce natural-sounding and flexible voices.

SPEECH SYNTHESIS

High Fidelity Speech Synthesis with Adversarial Networks

ICLR 2020

However, their application in the audio domain has received limited attention, and autoregressive models, such as WaveNet, remain the state of the art in generative modelling of audio signals such as human speech.

SPEECH SYNTHESIS

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

ICLR 2020

Recent work has explored sequence-to-sequence latent variable models for expressive speech synthesis (supporting control and transfer of prosody and style), but has not presented a coherent framework for understanding the trade-offs between the competing methods.

LATENT VARIABLE MODELS SPEECH SYNTHESIS STYLE TRANSFER

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

ICLR 2020

We present a novel generative model that combines state-of-the-art neural text- to-speech (TTS) with semi-supervised probabilistic latent variable models.

LATENT VARIABLE MODELS SPEECH SYNTHESIS

Attention Forcing for Sequence-to-sequence Model Training

ICLR 2020

This paper introduces attention forcing, which guides the model with generated output history and reference attention.

MACHINE TRANSLATION SPEECH SYNTHESIS

Using a Pitch-Synchronous Residual Codebook for Hybrid HMM/Frame Selection Speech Synthesis

30 Dec 2019

The source signal is obtained by concatenating excitation frames picked up from the codebook, based on a selection criterion and taking target residual coefficients as input.

SPEECH SYNTHESIS

A Deterministic plus Stochastic Model of the Residual Signal for Improved Parametric Speech Synthesis

29 Dec 2019

For this, we hereby propose an adaptation of the Deterministic plus Stochastic Model (DSM) for the residual.

SPEECH SYNTHESIS

The Deterministic plus Stochastic Model of the Residual Signal and its Applications

29 Dec 2019

The applicability of the DSM in two fields of speech processing is then studied.

SPEAKER IDENTIFICATION SPEECH SYNTHESIS

Learning Singing From Speech

20 Dec 2019

The proposed algorithm first integrate speech and singing synthesis into a unified framework, and learns universal speaker embeddings that are shareable between speech and singing synthesis tasks.

SPEECH SYNTHESIS VOICE CONVERSION