Search Results for author: Tuomo Raitio

Found 8 papers, 0 papers with code

Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling

no code implementations • 20 Dec 2022 • Tuomo Raitio, Javier Latorre, Andrea Davis, Tuuli Morrill, Ladan Golipour

Neural text-to-speech (TTS) can provide quality close to natural speech if an adequate amount of high-quality speech material is available for training.

Paper
Add Code

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise

no code implementations • 20 Mar 2022 • Tuomo Raitio, Petko Petkov, Jiangchuan Li, Muhammed Shifas, Andrea Davis, Yannis Stylianou

We present a neural text-to-speech (TTS) method that models natural vocal effort variation to improve the intelligibility of synthetic speech in the presence of noise.

Paper
Add Code

Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS

no code implementations • 6 Oct 2021 • Tuomo Raitio, Jiangchuan Li, Shreyas Seshadri

Neural text-to-speech (TTS) synthesis can generate speech that is indistinguishable from natural speech.

Paper
Add Code

Emphasis control for parallel neural TTS

no code implementations • 6 Oct 2021 • Shreyas Seshadri, Tuomo Raitio, Dan Castellani, Jiangchuan Li

Recent parallel neural text-to-speech (TTS) synthesis methods are able to generate speech with high fidelity while maintaining high performance.

Sentence

Paper
Add Code

On-device neural speech synthesis

no code implementations • 17 Sep 2021 • Sivanand Achanta, Albert Antony, Ladan Golipour, Jiangchuan Li, Tuomo Raitio, Ramya Rasipuram, Francesco Rossi, Jennifer Shi, Jaimin Upadhyay, David Winarsky, Hepeng Zhang

Recent advances in text-to-speech (TTS) synthesis, such as Tacotron and WaveRNN, have made it possible to construct a fully neural network based TTS system, by coupling the two components together.

Speech Synthesis

Paper
Add Code

Whispered and Lombard Neural Speech Synthesis

no code implementations • 13 Jan 2021 • Qiong Hu, Tobias Bleisch, Petko Petkov, Tuomo Raitio, Erik Marchi, Varun Lakshminarasimhan

2) Although our speaker verification (SV) model is not explicitly trained to discriminate different speaking styles, and no Lombard and whisper voice is used for pre-training this system, the SV model can be used as a style encoder for generating different style embeddings as input for the Tacotron system.

Speaker Verification Speech Synthesis

Paper
Add Code

Controllable neural text-to-speech synthesis using intuitive prosodic features

no code implementations • 14 Sep 2020 • Tuomo Raitio, Ramya Rasipuram, Dan Castellani

Modern neural text-to-speech (TTS) synthesis can generate speech that is indistinguishable from natural speech.

Sentence Speech Synthesis +1

Paper
Add Code

Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation

no code implementations • 7 Jun 2020 • Onur Babacan, Thomas Drugman, Tuomo Raitio, Daniel Erro, Thierry Dutoit

Various parametric representations have been proposed to model the speech signal.

Singing Voice Synthesis

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.