no code implementations • 21 Jul 2021 • Joanna Rownicka, Kilian Sprenkamp, Antonio Tripiana, Volodymyr Gromoglasov, Timo P Kunz
We describe our approach to create and deliver a custom voice for a conversational AI use-case.
2 code implementations • 28 Feb 2020 • Jennifer Williams, Joanna Rownicka, Pilar Oplustil, Simon King
Our NN predicts MOS with a high correlation to human judgments.
no code implementations • 31 Oct 2019 • Joanna Rownicka, Peter Bell, Steve Renals
We propose a multi-scale octave convolution layer to learn robust speech representations efficiently.
no code implementations • 30 Sep 2019 • Joanna Rownicka, Peter Bell, Steve Renals
In this work, we investigate the use of embeddings for speaker-adaptive training of DNNs (DNN-SAT) focusing on a small amount of adaptation data per speaker.
no code implementations • 23 Sep 2019 • Jennifer Williams, Joanna Rownicka
Our system used convolutional neural networks (CNNs) and a representation of the speech audio that combined x-vector attack embeddings with signal processing features.
no code implementations • 12 Nov 2018 • Joanna Rownicka, Peter Bell, Steve Renals
We analyze the representations learned by deep CNNs and compare them with deep neural network (DNN) representations and i-vectors, in the context of acoustic model adaptation.