Browse > Speech > Speech Recognition

Speech Recognition

176 papers with code · Speech

Speech recognition is the task of recognising speech within audio and converting it into text.

State-of-the-art leaderboards

Latest papers with code

MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible

30 Jul 2019getalp/mass-dataset

Therefore, this article proposes to add multilingual links between speech segments in different languages, and shares a large and clean dataset of 8, 130 para-lel spoken utterances across 8 languages (56 language pairs). We name this corpus MaSS (Multilingual corpus of Sentence-aligned Spoken utterances).

SPEECH RECOGNITION

4
30 Jul 2019

RadioTalk: a large-scale corpus of talk radio transcripts

16 Jul 2019social-machines/RadioTalk

We introduce RadioTalk, a corpus of speech recognition transcripts sampled from talk radio broadcasts in the United States between October of 2018 and March of 2019.

SPEECH RECOGNITION

33
16 Jul 2019

PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch

12 Jul 2019jzlianglu/pykaldi2

While similar toolkits are available built on top of the two, a key feature of PyKaldi2 is sequence training with criteria such as MMI, sMBR and MPE.

SPEECH RECOGNITION

71
12 Jul 2019

Attention model for articulatory features detection

2 Jul 2019sciforce/phones-las

Articulatory distinctive features, as well as phonetic transcription, play important role in speech-related tasks: computer-assisted pronunciation training, text-to-speech conversion (TTS), studying speech production mechanisms, speech recognition for low-resourced languages.

MANNER OF ARTICULATION DETECTION

4
02 Jul 2019

Enabling Real-time Neural IME with Incremental Vocabulary Selection

NAACL 2019 jiali-ms/JLM

Input method editor (IME) converts sequential alphabet key inputs to words in a target language.

LANGUAGE MODELLING SPEECH RECOGNITION

80
01 Jun 2019

RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation

8 May 2019rwth-i6/returnn

To the best knowledge of the authors, the results obtained when training on the full LibriSpeech training set, are the best published currently, both for the hybrid DNN/HMM and the attention-based systems.

END-TO-END SPEECH RECOGNITION LANGUAGE MODELLING SPEECH RECOGNITION

196
08 May 2019

Very Deep Self-Attention Networks for End-to-End Speech Recognition

30 Apr 2019quanpn90/NMTGMinor

Recently, end-to-end sequence-to-sequence models for speech recognition have gained significant interest in the research community.

END-TO-END SPEECH RECOGNITION SPEECH RECOGNITION

16
30 Apr 2019

Transformers with convolutional context for ASR

26 Apr 2019insop/pytorch-hackathon

The recent success of transformer networks for neural machine translation and other NLP tasks has led to a surge in research work trying to apply it for speech recognition.

MACHINE TRANSLATION SPEECH RECOGNITION

4
26 Apr 2019

Realizing Petabyte Scale Acoustic Modeling

24 Apr 2019OpenSourceAI/sota_server

We present the design and evaluation of a highly scalable and resource efficient SSL system for AM.

SPEECH RECOGNITION

1
24 Apr 2019