Browse > Speech > Speech Enhancement

Speech Enhancement

10 papers with code · Speech

Speech enhancement is the task of taking a noisy speech input and producing an enhanced speech output.

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline

27 Mar 2018kaldi-asr/kaldi

This paper describes a new baseline system for automatic speech recognition (ASR) in the CHiME-4 challenge to promote the development of noisy ASR in speech processing communities by providing 1) state-of-the-art system with a simplified single system comparable to the complicated top systems in the challenge, 2) publicly available and reproducible recipe through the main repository in the Kaldi speech recognition toolkit. In addition, the proposed baseline recipe includes four different speech enhancement measures, short-time objective intelligibility measure (STOI), extended STOI (eSTOI), perceptual evaluation of speech quality (PESQ) and speech distortion ratio (SDR) for the simulation test set.

DISTANT SPEECH RECOGNITION LANGUAGE MODELLING NOISY SPEECH RECOGNITION SPEECH ENHANCEMENT

SEGAN: Speech Enhancement Generative Adversarial Network

28 Mar 2017santi-pdp/segan

The majority of them tackle a limited number of noise conditions and rely on first-order statistics. In contrast to current techniques, we operate at the waveform level, training the model end-to-end, and incorporate 28 speakers and 40 different noise conditions into the same model, such that model parameters are shared across them.

SPEECH ENHANCEMENT

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks

31 Aug 2018santi-pdp/segan_pytorch

Most methods of voice restoration for patients suffering from aphonia either produce whispered or monotone speech. Apart from intelligibility, this type of speech lacks expressiveness and naturalness due to the absence of pitch (whispered speech) or artificial generation of it (monotone speech).

SPEECH ENHANCEMENT

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network

18 Dec 2017santi-pdp/segan_pytorch

This makes the adaptability of those systems into new, low resource environments an important topic. In this work, we present the results of adapting a speech enhancement generative adversarial network by finetuning the generator with small amounts of data.

SPEECH ENHANCEMENT

Contaminated speech training methods for robust DNN-HMM distant speech recognition

10 Oct 2017mravanelli/pySpeechRev

Despite the significant progress made in the last years, state-of-the-art speech recognition technologies provide a satisfactory performance only in the close-talking condition. Robustness of distant speech recognition in adverse acoustic conditions, on the other hand, remains a crucial open issue for future applications of human-machine interaction.

DISTANT SPEECH RECOGNITION SPEECH ENHANCEMENT

Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

27 Mar 2018wangkenpu/rsrgan

We investigate the use of generative adversarial networks (GANs) in speech dereverberation for robust speech recognition. First, we study the effectiveness of different dereverberation networks (the generator in GAN) and find that LSTM leads a significant improvement as compared with feed-forward DNN and CNN in our dataset.

ROBUST SPEECH RECOGNITION SPEECH ENHANCEMENT

Improved Speech Enhancement with the Wave-U-Net

27 Nov 2018craigmacartney/Wave-U-Net-For-Speech-Enhancement

We study the use of the Wave-U-Net architecture for speech enhancement, a model introduced by Stoller et al for the separation of music vocals and accompaniment. This end-to-end learning method for audio source separation operates directly in the time domain, permitting the integrated modelling of phase information and being able to take large temporal contexts into account.

SPEECH ENHANCEMENT SPEECH RECOGNITION

A Fully Convolutional Neural Network for Speech Enhancement

22 Sep 2016achaitu/SpeechDenoisingDNN

In hearing aids, the presence of babble noise degrades hearing intelligibility of human speech greatly. However, removing the babble without creating artifacts in human speech is a challenging task in a low SNR environment.

SPEECH ENHANCEMENT

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

6 Nov 2018dr-pato/audio_visual_speech_enhancement

In this paper, we address the problem of enhancing the speech of a speaker of interest in a cocktail party scenario when visual information of the speaker of interest is available. Contrary to most previous studies, we do not learn visual features on the typically small audio-visual datasets, but use an already available face landmark detector (trained on a separate image dataset).

SPEECH ENHANCEMENT

Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization

15 Sep 2017mohammadiha/bnmf

Reducing the interference noise in a monaural noisy speech signal has been a challenging task for many years. We propose a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF).

DENOISING SPEECH ENHANCEMENT