no code implementations • 31 Mar 2022 • Hubert Siuzdak, Piotr Dura, Pol van Rijn, Nori Jacoby
Recent advances in neural text-to-speech research have been dominated by two-stage pipelines utilizing low-level intermediate speech representation such as mel-spectrograms.