speech-recognition

1005 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find speech-recognition models and implementations
16 papers
7,912
11 papers
45
10 papers
29,318
See all 16 libraries.

Most implemented papers

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

microsoft/unilm 26 Oct 2021

Self-supervised learning (SSL) achieves great success in speech recognition, while limited exploration has been attempted for other speech processing tasks.

Data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

pytorch/fairseq Preprint 2022

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind.

An exact mapping between the Variational Renormalization Group and Deep Learning

rodsveiga/rbm_flows_ising 14 Oct 2014

Here, we show that deep learning is intimately related to one of the most important and successful techniques in theoretical physics, the renormalization group (RG).

Neural NILM: Deep Neural Networks Applied to Energy Disaggregation

JackKelly/neuralnilm_prototype 23 Jul 2015

Energy disaggregation estimates appliance-by-appliance electricity consumption from a single meter that measures the whole home's electricity demand.

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

yajiemiao/eesen 29 Jul 2015

The performance of automatic speech recognition (ASR) has improved tremendously due to the application of deep neural networks (DNNs).

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

tensorflow/tensorflow 14 Mar 2016

TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms.

Geometric deep learning on graphs and manifolds using mixture model CNNs

dmlc/dgl CVPR 2017

Recently, there has been an increasing interest in geometric deep learning, attempting to generalize deep learning methods to non-Euclidean structured data such as graphs and manifolds, with a variety of applications from the domains of network analysis, computational social science, or computer graphics.

Combining Residual Networks with LSTMs for Lipreading

tstafylakis/Lipreading-ResNet 12 Mar 2017

We propose an end-to-end deep learning architecture for word-level visual speech recognition.

Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting

castorini/honk 18 Oct 2017

We describe Honk, an open-source PyTorch reimplementation of convolutional neural networks for keyword spotting that are included as examples in TensorFlow.

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

sooftware/End-to-end-Speech-Recognition 5 Dec 2017

Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural network.