Browse > Speech > Speech Recognition

Speech Recognition

176 papers with code ยท Speech

Speech recognition is the task of recognising speech within audio and converting it into text.

State-of-the-art leaderboards

Latest papers without code

Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews

19 Aug 2019

In automatic speech recognition, often little training data is available for specific challenging tasks, but training of state-of-the-art automatic speech recognition systems requires large amounts of annotated speech.

DATA AUGMENTATION ROBUST SPEECH RECOGNITION TRANSFER LEARNING

IMS-Speech: A Speech to Text Tool

13 Aug 2019

We present the IMS-Speech, a web based tool for German and English speech transcription aiming to facilitate research in various disciplines which require accesses to lexical information in spoken language materials.

SPEECH RECOGNITION

End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning

13 Aug 2019

This paper presents our latest investigation on end-to-end automatic speech recognition (ASR) for overlapped speech.

SPEECH RECOGNITION TRANSFER LEARNING

Personal VAD: Speaker-Conditioned Voice Activity Detection

12 Aug 2019

In this paper, we propose "personal VAD", a system to detect the voice activity of a target speaker at the frame level.

ACTION DETECTION SPEAKER RECOGNITION SPEAKER VERIFICATION SPEECH RECOGNITION

Space-time error estimates for deep neural network approximations for differential equations

11 Aug 2019

It is the subject of the main result of this article to provide space-time error estimates for DNN approximations of Euler approximations of certain perturbed differential equations.

IMAGE CLASSIFICATION SPEECH RECOGNITION

Unsupervised Stemming based Language Model for Telugu Broadcast News Transcription

10 Aug 2019

By using techniques like smoothing and interpolation of pre-processed data with supervised and unsupervised stemming, different issues in language model for Indian language: Telugu has been addressed.

LANGUAGE MODELLING SPEECH RECOGNITION

Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling

9 Aug 2019

Out-of-domain ASR systems can be applied to perform speaker adaptation with untranscribed training data of the target language, and to decode the training speech into frame-level labels for DNN training.

MULTI-TASK LEARNING SPEECH RECOGNITION

Challenging the Boundaries of Speech Recognition: The MALACH Corpus

9 Aug 2019

This paper proposes that the community place focus on the MALACH corpus to develop speech recognition systems that are more robust with respect to accents, disfluencies and emotional speech.

SPEECH RECOGNITION

Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants

9 Aug 2019

The voice signal is a rich resource that discloses several possible states of a speaker, such as emotional state, confidence and stress levels, physical condition, age, gender, and personal traits.

EMOTION RECOGNITION SPEECH RECOGNITION VOICE CONVERSION

Mitigating Noisy Inputs for Question Answering

8 Aug 2019

We investigate and mitigate the effects of noise from Automatic Speech Recognition systems on two factoid Question Answering (QA) tasks.

MACHINE TRANSLATION OPTICAL CHARACTER RECOGNITION QUESTION ANSWERING SPEECH RECOGNITION