Browse > Speech > Speech Enhancement

Speech Enhancement

23 papers with code · Speech

Speech enhancement is the task of taking a noisy speech input and producing an enhanced speech output.

( Image credit: A Fully Convolutional Neural Network For Speech Enhancement )

Leaderboards

Greatest papers with code

SEGAN: Speech Enhancement Generative Adversarial Network

28 Mar 2017santi-pdp/segan

In contrast to current techniques, we operate at the waveform level, training the model end-to-end, and incorporate 28 speakers and 40 different noise conditions into the same model, such that model parameters are shared across them.

SPEECH ENHANCEMENT

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

11 Oct 2018mindslab-ai/voicefilter

In this paper, we present a novel system that separates the voice of a target speaker from multi-speaker signals, by making use of a reference signal from the target speaker.

SPEAKER RECOGNITION SPEAKER SEPARATION SPEECH ENHANCEMENT SPEECH RECOGNITION

Deep learning for minimum mean-square error approaches to speech enhancement

Speech communication 2019 anicolson/DeepXi

MMSE approaches utilising the proposed a priori SNR estimator are able to achieve higher enhanced speech quality and intelligibility scores than recent masking- and mapping-based deep learning approaches.

SPEECH ENHANCEMENT

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks

31 Aug 2018santi-pdp/segan_pytorch

Most methods of voice restoration for patients suffering from aphonia either produce whispered or monotone speech.

SPEECH ENHANCEMENT

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network

18 Dec 2017santi-pdp/segan_pytorch

In this work, we present the results of adapting a speech enhancement generative adversarial network by finetuning the generator with small amounts of data.

SPEECH ENHANCEMENT

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

5 Jan 2019cvdfoundation/ava-dataset

The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.

SPEAKER DIARIZATION SPEECH ENHANCEMENT

A Fully Convolutional Neural Network for Speech Enhancement

22 Sep 2016zhr1201/CNN-for-single-channel-speech-enhancement

In hearing aids, the presence of babble noise degrades hearing intelligibility of human speech greatly.

SPEECH ENHANCEMENT

Improved Speech Enhancement with the Wave-U-Net

27 Nov 2018craigmacartney/Wave-U-Net-For-Speech-Enhancement

We study the use of the Wave-U-Net architecture for speech enhancement, a model introduced by Stoller et al for the separation of music vocals and accompaniment.

SPEECH ENHANCEMENT SPEECH RECOGNITION

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Speech Quality and Testing Framework

23 Jan 2020microsoft/DNS-Challenge

In this challenge, we open-source a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

SPEECH ENHANCEMENT

Contaminated speech training methods for robust DNN-HMM distant speech recognition

10 Oct 2017mravanelli/pySpeechRev

Despite the significant progress made in the last years, state-of-the-art speech recognition technologies provide a satisfactory performance only in the close-talking condition.

DISTANT SPEECH RECOGNITION SPEECH ENHANCEMENT