Search Results for author: Nicola Pia

Found 6 papers, 1 papers with code

SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement

no code implementations • 4 Dec 2023 • Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler

This paper proposes SEFGAN, a Deep Neural Network (DNN) combining maximum likelihood training and Generative Adversarial Networks (GANs) for efficient speech enhancement (SE).

Audio Generation Speech Enhancement

Paper
Add Code

Low-Resource Text-to-Speech Using Specific Data and Noise Augmentation

no code implementations • 16 Jun 2023 • Kishor Kayyar Lakshminarayana, Christian Dittmar, Nicola Pia, Emanuël Habets

These architectures must be trained with tens of hours of annotated and high-quality speech data.

Data Augmentation

Paper
Add Code

NESC: Robust Neural End-2-End Speech Coding with GANs

no code implementations • 7 Jul 2022 • Nicola Pia, Kishan Gupta, Srikanth Korse, Markus Multrus, Guillaume Fuchs

Neural networks have proven to be a formidable tool to tackle the problem of speech coding at very low bit rates.

Paper
Add Code

PostGAN: A GAN-Based Post-Processor to Enhance the Quality of Coded Speech

no code implementations • 31 Jan 2022 • Srikanth Korse, Nicola Pia, Kishan Gupta, Guillaume Fuchs

In order to mitigate these coding artefacts and enhance the quality of coded speech, a post-processor that relies on a-priori information transmitted from the encoder is traditionally employed at the decoder side.

Paper
Add Code

A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate

no code implementations • 9 Aug 2021 • Ahmed Mustafa, Jan Büthe, Srikanth Korse, Kishan Gupta, Guillaume Fuchs, Nicola Pia

Recently, GAN vocoders have seen rapid progress in speech synthesis, starting to outperform autoregressive models in perceptual quality with much higher generation speed.

Speech Synthesis

Paper
Add Code

StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization

2 code implementations • 3 Nov 2020 • Ahmed Mustafa, Nicola Pia, Guillaume Fuchs

In recent years, neural vocoders have surpassed classical speech generation approaches in naturalness and perceptual quality of the synthesized speech.

Spectral Reconstruction Vocal Bursts Intensity Prediction

10,167

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.