speech-recognition

1005 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in speech-recognition

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find speech-recognition models and implementations

espnet/espnet

16 papers

7,912

msalhab96/SpeeQ

11 papers

pytorch/fairseq

10 papers

29,318

PaddlePaddle/PaddleSpeech

10 papers

10,186

See all 16 libraries.

Most implemented papers

Most implemented Social Latest No code

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

microsoft/unilm • • 26 Oct 2021

Self-supervised learning (SSL) achieves great success in speech recognition, while limited exploration has been attempted for other speech processing tasks.

Paper
Code

Data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

pytorch/fairseq • • Preprint 2022

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind.

Paper
Code

An exact mapping between the Variational Renormalization Group and Deep Learning

rodsveiga/rbm_flows_ising • • 14 Oct 2014

Here, we show that deep learning is intimately related to one of the most important and successful techniques in theoretical physics, the renormalization group (RG).

Paper
Code

Neural NILM: Deep Neural Networks Applied to Energy Disaggregation

JackKelly/neuralnilm_prototype • • 23 Jul 2015

Energy disaggregation estimates appliance-by-appliance electricity consumption from a single meter that measures the whole home's electricity demand.

Paper
Code

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

yajiemiao/eesen • 29 Jul 2015

The performance of automatic speech recognition (ASR) has improved tremendously due to the application of deep neural networks (DNNs).

Paper
Code

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

tensorflow/tensorflow • • 14 Mar 2016

TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms.

Paper
Code

Geometric deep learning on graphs and manifolds using mixture model CNNs

dmlc/dgl • • CVPR 2017

Recently, there has been an increasing interest in geometric deep learning, attempting to generalize deep learning methods to non-Euclidean structured data such as graphs and manifolds, with a variety of applications from the domains of network analysis, computational social science, or computer graphics.

Paper
Code

Combining Residual Networks with LSTMs for Lipreading

tstafylakis/Lipreading-ResNet • • 12 Mar 2017

We propose an end-to-end deep learning architecture for word-level visual speech recognition.

Paper
Code

Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting

castorini/honk • • 18 Oct 2017

We describe Honk, an open-source PyTorch reimplementation of convolutional neural networks for keyword spotting that are included as examples in TensorFlow.

Paper
Code

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

sooftware/End-to-end-Speech-Recognition • • 5 Dec 2017

Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural network.

Paper
Code

speech-recognition

Benchmarks Add a Result

Libraries

Most implemented papers

Content

Benchmarks

Add a Result