Audio

Robust Speech Recognition

22 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Robust Speech Recognition

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Datasets

Most implemented papers

Most implemented Social Latest No code

Robust Speech Recognition via Large-Scale Weak Supervision

openai/whisper • • Preprint 2022

We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet.

Paper
Code

Very Deep Convolutional Neural Networks for Robust Speech Recognition

Anustup900/Tensorflow-Speech-Recognition • • 2 Oct 2016

On the Aurora 4 task, the very deep CNN achieves a WER of 8. 81%, further 7. 99% with auxiliary feature joint training, and 7. 09% with LSTM-RNN joint decoding.

Paper
Code

Scalable Factorized Hierarchical Variational Autoencoder Training

wnhsu/ScalableFHVAE • • 9 Apr 2018

Deep generative models have achieved great success in unsupervised learning with the ability to capture complex nonlinear relationships between latent generating factors and observations.

Paper
Code

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition

yuchen005/dpsl-asr • • 11 Oct 2021

Speech enhancement (SE) aims to suppress the additive noise from a noisy speech signal to improve the speech's perceptual quality and intelligibility.

Paper
Code

Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

wangkenpu/rsrgan • • 27 Mar 2018

First, we study the effectiveness of different dereverberation networks (the generator in GAN) and find that LSTM leads a significant improvement as compared with feed-forward DNN and CNN in our dataset.

Paper
Code

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

vivivic/speech-domain-adaptation-DRL • • 12 Apr 2019

The latent variables allow us to convert the domain of speech according to its context and domain representation.

Paper
Code

Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks

doglic/asr • • 23 Jun 2019

We investigate the potential of stochastic neural networks for learning effective waveform-based acoustic models.

Paper
Code

Multi-task self-supervised learning for Robust Speech Recognition

santi-pdp/pase • • 25 Jan 2020

We then propose a revised encoder that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks.

Paper
Code