Browse > Speech > Speech Recognition

Speech Recognition

211 papers with code ยท Speech

Speech recognition is the task of recognising speech within audio and converting it into text.

( Image credit: SpecAugment )

Leaderboards

Latest papers without code

TLT-school: a Corpus of Non Native Children Speech

22 Jan 2020

This paper describes "TLT-school" a corpus of speech utterances collected in schools of northern Italy for assessing the performance of students learning both English and German.

SPEECH RECOGNITION

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300

20 Jan 2020

It is generally believed that direct sequence-to-sequence (seq2seq) speech recognition models are competitive with hybrid models only when a large amount of data, at least a thousand hours, is available for training.

DATA AUGMENTATION LANGUAGE MODELLING SPEECH RECOGNITION

FlexiBO: Cost-Aware Multi-Objective Optimization of Deep Neural Networks

18 Jan 2020

This is particularly important for optimizing DNNs: the cost arising on account of assessing the accuracy of DNNs is orders of magnitude higher than that of measuring the energy consumption of pre-trained DNNs.

OBJECT DETECTION SPEECH RECOGNITION

Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture

15 Jan 2020

To support the online recognition, we integrate the state reuse chunk-SAE and the MTA based SAD into online CTC/attention architecture.

END-TO-END SPEECH RECOGNITION SPEECH RECOGNITION

Visually Guided Self Supervised Learning of Speech Representations

13 Jan 2020

Self supervised representation learning has recently attracted a lot of research interest for both the audio and visual modalities.

EMOTION RECOGNITION REPRESENTATION LEARNING SPEECH RECOGNITION

Improving Spoken Language Understanding By Exploiting ASR N-best Hypotheses

11 Jan 2020

The NLU module usually uses the first best interpretation of a given speech in downstream tasks such as domain and intent classification.

INTENT CLASSIFICATION SPEECH RECOGNITION SPOKEN LANGUAGE UNDERSTANDING

Improving Dysarthric Speech Intelligibility Using Cycle-consistent Adversarial Training

10 Jan 2020

Dysarthria is a motor speech impairment affecting millions of people.

SPEECH RECOGNITION

Open Challenge for Correcting Errors of Speech Recognition Systems

9 Jan 2020

The paper announces the new long-term challenge for improving the performance of automatic speech recognition systems.

SPEECH RECOGNITION

Streaming automatic speech recognition with the transformer model

8 Jan 2020

Encoder-decoder based sequence-to-sequence models have demonstrated state-of-the-art results in end-to-end automatic speech recognition (ASR).

END-TO-END SPEECH RECOGNITION SPEECH RECOGNITION

Audio-visual Recognition of Overlapped speech for the LRS2 dataset

6 Jan 2020

Experiments on overlapped speech simulated from the LRS2 dataset suggest the proposed AVSR system outperformed the audio only baseline LF-MMI DNN system by up to 29. 98\% absolute in word error rate (WER) reduction, and produced recognition performance comparable to a more complex pipelined system.

AUDIO-VISUAL SPEECH RECOGNITION SPEECH SEPARATION VISUAL SPEECH RECOGNITION