1 code implementation • 29 Mar 2022 • Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti
We explore the use of speech synthesis and voice conversion applied to augment datasets for automatic speech recognition (ASR) systems, in scenarios with only one speaker available for the target language.
2 code implementations • 4 Dec 2021 • Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Candido Junior, Eren Gölge, Moacir Antonelli Ponti
YourTTS brings the power of a multilingual approach to the task of zero-shot multi-speaker TTS.
1 code implementation • 6 Sep 2021 • Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari
Training deep neural networks may be challenging in real world data.
2 code implementations • 2 Apr 2021 • Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti
In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training.
1 code implementation • 11 May 2020 • Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, João Paulo Teixeira, Moacir Antonelli Ponti, Sandra Maria Aluisio
Speech provides a natural way for human-computer interaction.
1 code implementation • 25 Feb 2020 • Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluisio, Moacir Antonelli Ponti
We compare the three best architectures trained using our method to select the best one, which is the one with a shallow architecture.
no code implementations • 28 Jan 2019 • Fernando Pereira dos Santos, Leonardo Sampaio Ferraz Ribeiro, Moacir Antonelli Ponti
By proposing novel cross-domain generalization measures, we study how source features can generalize for different target video domains, as well as analyze unsupervised transfer learning.
no code implementations • 1 Nov 2018 • Gabriel B. Cavallari, Leonardo Sampaio Ferraz Ribeiro, Moacir Antonelli Ponti
The models are evaluated considering both the reconstruction error of the images and the feature spaces in terms of their discriminative power.
no code implementations • 20 Jun 2018 • Moacir Antonelli Ponti, Gabriel B. Paranhos da Costa
Deep Learning methods are currently the state-of-the-art in many problems which can be tackled via machine learning, in particular classification problems.
no code implementations • 7 May 2018 • Rodrigo Fernandes de Mello, Moacir Antonelli Ponti, Carlos Henrique Grossi Ferreira
The Statistical Learning Theory (SLT) provides the theoretical guarantees for supervised machine learning based on the Empirical Risk Minimization Principle (ERMP).
no code implementations • 28 Nov 2017 • Rodrigo Fernandes de Mello, Martha Dais Ferreira, Moacir Antonelli Ponti
Deep Learning (DL) is one of the most common subjects when Machine Learning and Data Science approaches are considered.