Browse > Speech > Text-To-Speech Synthesis

Text-To-Speech Synthesis

11 papers with code ยท Speech

Leaderboards

Latest papers without code

Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech

28 Nov 2019

We propose a Text-to-Speech method to create an unseen expressive style using one utterance of expressive speech of around one second.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

Independent and automatic evaluation of acoustic-to-articulatory inversion models

15 Nov 2019

Reconstruction of articulatory trajectories from the acoustic speech signal has been proposed for improving speech recognition and text-to-speech synthesis.

SPEECH RECOGNITION SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis

11 Nov 2019

In Mandarin text-to-speech (TTS) system, the front-end text processing module significantly influences the intelligibility and naturalness of synthesized speech.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework

7 Nov 2019

Text-to-speech synthesis (TTS) has witnessed rapid progress in recent years, where neural methods became capable of producing audio with near human-level naturalness.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach

14 Oct 2019

Finally, we focus on the last one, with the last techniques modeling Text-to-Speech synthesis as a sequence-to-sequence problem.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs

9 Sep 2019

We compare the results obtained from evaluating sentences in isolation, evaluating whole paragraphs of speech, and presenting a selection of speech or text as context and evaluating the subsequent speech.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain

3 Jun 2019

Previously, a machine speech chain, which is based on sequence-to-sequence deep learning, was proposed to mimic speech perception and production behavior.

DATA AUGMENTATION IMAGE CAPTIONING IMAGE RETRIEVAL SPEECH RECOGNITION SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

Neural Text Normalization with Subword Units

NAACL 2019

We find subword models with additional linguistic features yield the best performance (with a word error rate of 0. 17{\%}).

MACHINE TRANSLATION SPEECH RECOGNITION SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

NAACL 2019

Neural text-to-speech synthesis (NTTS) models have shown significant progress in generating high-quality speech, however they require a large quantity of training data.

SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS WORD EMBEDDINGS