Search Results for author: Linhao Dong

Found 9 papers, 3 papers with code

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

no code implementations27 May 2023 Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma

We also observe the cross-modal representation extracted by CIF-PT obtains better performance than other neural interfaces for the tasks of SLU, including the dominant speech representation learned from self-supervised pre-training.

intent-classification Intent Classification +5

Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

1 code implementation30 Jan 2022 Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

Nowadays, most methods in end-to-end contextual speech recognition bias the recognition process towards contextual knowledge.

speech-recognition Speech Recognition

CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition

no code implementations17 Dec 2020 Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

End-to-end (E2E) models have achieved promising results on multiple speech recognition benchmarks, and shown the potential to become the mainstream.

speech-recognition Speech Recognition

CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition

2 code implementations27 May 2019 Linhao Dong, Bo Xu

In this paper, we propose a novel soft and monotonic alignment mechanism used for sequence transduction.

Language Modelling Multi-Task Learning +2

Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping

no code implementations18 Feb 2019 Linhao Dong, Feng Wang, Bo Xu

Experiments on two Mandarin ASR datasets show the replacement of RNNs by the self-attention networks yields a 8. 4%-10. 2% relative character error rate (CER) reduction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese

no code implementations16 May 2018 Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu

Experiments on HKUST datasets demonstrate that the lexicon free modeling units can outperform lexicon related modeling units in terms of character error rate (CER).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese

1 code implementation28 Apr 2018 Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu

Furthermore, we investigate a comparison between syllable based model and context-independent phoneme (CI-phoneme) based model with the Transformer in Mandarin Chinese.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Cannot find the paper you are looking for? You can Submit a new open access paper.