Search Results for author: Keqi Deng

Found 11 papers, 1 papers with code

FastInject: Injecting Unpaired Text Data into CTC-based ASR training

no code implementations14 Dec 2023 Keqi Deng, Philip C. Woodland

Recently, connectionist temporal classification (CTC)-based end-to-end (E2E) automatic speech recognition (ASR) models have achieved impressive results, especially with the development of self-supervised learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition

no code implementations19 Nov 2023 Keqi Deng, Philip C. Woodland

An Auto-regressive Integrate-and-Fire (AIF) mechanism is proposed to generate the label-level encoder representation while retaining low latency operation that can be used for streaming.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Decoupled Structure for Improved Adaptability of End-to-End Models

no code implementations25 Aug 2023 Keqi Deng, Philip C. Woodland

Although end-to-end (E2E) trainable automatic speech recognition (ASR) has shown great success by jointly learning acoustic and linguistic information, it still suffers from the effect of domain shifts, thus limiting potential applications.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Label-Synchronous Neural Transducer for End-to-End ASR

no code implementations6 Jul 2023 Keqi Deng, Philip C. Woodland

Hence blank tokens are no longer needed and the prediction network can be easily adapted using text data.

Domain Adaptation

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax

no code implementations16 Feb 2023 Keqi Deng, Philip C. Woodland

End-to-end (E2E) automatic speech recognition (ASR) implicitly learns the token sequence distribution of paired audio-transcript training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation

no code implementations19 Apr 2022 Keqi Deng, Shinji Watanabe, Jiatong Shi, Siddhant Arora

Although Transformers have gained success in several speech processing tasks like spoken language understanding (SLU) and speech translation (ST), achieving online processing while keeping competitive performance is still essential for real-world interaction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

1 code implementation22 Feb 2022 Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2. 0 models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning

no code implementations15 Sep 2021 Keqi Deng, Songjun Cao, Long Ma

For the former task, a standard deviation constraint loss (SDC-loss) based end-to-end (E2E) architecture is proposed to identify accents under the same language.

Accented Speech Recognition Automatic Speech Recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.