no code implementations • 1 Jun 2023 • Yuting Yang, Yuke Li, Binbin Du
Specifically, the top-layer hidden representation at the same frame of the streaming and non-streaming modes are regarded as a positive pair, encouraging the representation of the streaming mode close to its non-streaming counterpart.
no code implementations • 13 Mar 2023 • Binbin Du, Rui Deng, Yingxin Zhang
In task3, we employ the ASR system to improve the visual system, some false subtitles can be corrected by a fusion module.
no code implementations • 25 May 2022 • Yuting Yang, Yuke Li, Binbin Du
The CTC-based automatic speech recognition (ASR) models without the external language model usually lack the capacity to model conditional dependencies and textual interactions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 24 May 2022 • Yuting Yang, Binbin Du, Yuke Li
Thus only considering the writing of Chinese characters as modeling units is insufficient to capture speech features.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 3 Dec 2021 • Yuting Yang, Binbin Du, Yingxin Zhang, Wenxuan Wang, Yuke Li
We propose a mandarin keyword spotting system (KWS) with several novel and effective improvements, including a big backbone (B) model, a keyword biasing (B) mechanism and the introduction of syllable modeling units (S).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2