no code implementations • 4 Dec 2023 • Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler
This paper proposes SEFGAN, a Deep Neural Network (DNN) combining maximum likelihood training and Generative Adversarial Networks (GANs) for efficient speech enhancement (SE).
no code implementations • 16 Jun 2023 • Kishor Kayyar Lakshminarayana, Christian Dittmar, Nicola Pia, Emanuël Habets
These architectures must be trained with tens of hours of annotated and high-quality speech data.
no code implementations • 7 Jul 2022 • Nicola Pia, Kishan Gupta, Srikanth Korse, Markus Multrus, Guillaume Fuchs
Neural networks have proven to be a formidable tool to tackle the problem of speech coding at very low bit rates.
no code implementations • 31 Jan 2022 • Srikanth Korse, Nicola Pia, Kishan Gupta, Guillaume Fuchs
In order to mitigate these coding artefacts and enhance the quality of coded speech, a post-processor that relies on a-priori information transmitted from the encoder is traditionally employed at the decoder side.
no code implementations • 9 Aug 2021 • Ahmed Mustafa, Jan Büthe, Srikanth Korse, Kishan Gupta, Guillaume Fuchs, Nicola Pia
Recently, GAN vocoders have seen rapid progress in speech synthesis, starting to outperform autoregressive models in perceptual quality with much higher generation speed.
2 code implementations • 3 Nov 2020 • Ahmed Mustafa, Nicola Pia, Guillaume Fuchs
In recent years, neural vocoders have surpassed classical speech generation approaches in naturalness and perceptual quality of the synthesized speech.