no code implementations • 2 Apr 2025 • Zhiyuan Tang, Dong Wang, Zhikai Zhou, Yong liu, Shen Huang, Shidong Shang
Full-text error correction with Large Language Models (LLMs) for Automatic Speech Recognition (ASR) has gained increased attention due to its potential to correct errors across long contexts and address a broader spectrum of error types, including punctuation restoration and inverse text normalization.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 12 Sep 2024 • Zhiyuan Tang, Dong Wang, Shen Huang, Shidong Shang
This paper investigates the effectiveness of LLMs for error correction in full-text generated by ASR systems from longer speech recordings, such as transcripts from podcasts, news broadcasts, and meetings.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+10
1 code implementation • 2 Jul 2024 • Zhiyuan Tang, Dong Wang, Shen Huang, Shidong Shang
Firstly, we construct a specialized benchmark dataset aimed at error correction for Chinese ASR with 724K hypotheses-transcription pairs, named the Chinese Hypotheses Paradise dataset (ChineseHP), which contains a wide range of scenarios and presents significant challenges.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 14 Mar 2023 • Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang
This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge.
1 code implementation • 2 Apr 2021 • Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang
The ConferencingSpeech 2021 challenge is proposed to stimulate research on far-field multi-channel speech enhancement for video conferencing.