Search Results for author: Edresson Casanova

Found 9 papers, 7 papers with code

A single speaker is almost all you need for automatic speech recognition

1 code implementation29 Mar 2022 Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti

We explore the use of speech synthesis and voice conversion applied to augment datasets for automatic speech recognition (ASR) systems, in scenarios with only one speaker available for the target language.

Automatic Speech Recognition Data Augmentation +2

Brazilian Portuguese Speech Recognition Using Wav2vec 2.0

1 code implementation23 Jul 2021 Lucas Rafael Stefanel Gris, Edresson Casanova, Frederico Santos de Oliveira, Anderson da Silva Soares, Arnaldo Candido Junior

In this sense, this work presents the development of an public Automatic Speech Recognition (ASR) system using only open available audio data, from the fine-tuning of the Wav2vec 2. 0 XLSR-53 model pre-trained in many languages, over BP data.

Automatic Speech Recognition

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

2 code implementations2 Apr 2021 Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti

In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training.

Cannot find the paper you are looking for? You can Submit a new open access paper.