Phoneme Recognition

27 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Phoneme Recognition models and implementations

Datasets


Most implemented papers

WaveNet: A Generative Model for Raw Audio

ibab/tensorflow-wavenet 12 Sep 2016

This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.

Attention-Based Models for Speech Recognition

Alexander-H-Liu/End-to-end-ASR-Pytorch NeurIPS 2015

Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance on a range of tasks in- cluding machine translation, handwriting synthesis and image caption gen- eration.

Sequence Transduction with Recurrent Neural Networks

TensorSpeech/TensorFlowASR 14 Nov 2012

One of the key challenges in sequence transduction is learning to represent both the input and output sequences in a way that is invariant to sequential distortions such as shrinking, stretching and translating.

Speech Recognition with Deep Recurrent Neural Networks

HawkAaron/warp-transducer 22 Mar 2013

Recurrent neural networks (RNNs) are a powerful model for sequential data.

Do Deep Nets Really Need to be Deep?

jchen98/compression NeurIPS 2014

Currently, deep neural networks are the state of the art on problems such as speech recognition and computer vision.

Simple and Effective Zero-shot Cross-lingual Phoneme Recognition

facebookresearch/fairseq 23 Sep 2021

Recent progress in self-training, self-supervised pretraining and unsupervised learning enabled well performing speech recognition systems without any labeled data.

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment

mu-y/mpl-mdd 29 Mar 2022

We show that fine-tuning with pseudo labels achieves a 5. 35% phoneme error rate reduction and 2. 48% MDD F1 score improvement over a labeled-samples-only fine-tuning baseline.

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks

iSkaCh/PhonemeRecog-Without-MFCC- 7 Dec 2013

Most phoneme recognition state-of-the-art systems rely on a classical neural network classifiers, fed with highly tuned features, such as MFCC or PLP features.

Regularizing RNNs by Stabilizing Activations

vimarshc/fastai_experiments 26 Nov 2015

We stabilize the activations of Recurrent Neural Networks (RNNs) by penalizing the squared distance between successive hidden states' norms.

Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks

sdrobert/more-or-let 10 Jan 2017

Meanwhile, Connectionist Temporal Classification (CTC) with Recurrent Neural Networks (RNNs), which is proposed for labeling unsegmented sequences, makes it feasible to train an end-to-end speech recognition system instead of hybrid settings.