Zero-Shot Multi-Speaker TTS

3 papers with code • 0 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

coqui-ai/TTS 4 Dec 2021

YourTTS brings the power of a multilingual approach to the task of zero-shot multi-speaker TTS.

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS

ogunlao/glowtts_stdp 28 May 2023

Flow-based generative models are widely used in text-to-speech (TTS) systems to learn the distribution of audio features (e. g., Mel-spectrograms) given the input tokens and to sample from this distribution to generate diverse utterances.

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Edresson/ZS-TTS-Evaluation 7 Jun 2024

Most Zero-shot Multi-speaker TTS (ZS-TTS) systems support only a single language.