no code implementations • 15 Sep 2023 • Dariusz Piotrowski, Renard Korzeniowski, Alessio Falai, Sebastian Cygert, Kamil Pokora, Georgi Tinchev, Ziyao Zhang, Kayoko Yanagisawa
In the first two stages, we use a VC model to convert utterances in the target locale to the voice of the target speaker.