Search Results for author: Hongyu Xiang

Found 6 papers, 4 papers with code

CPPF: A contextual and post-processing-free model for automatic speech recognition

no code implementations • 14 Sep 2023 • Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

To address this issue, we draw inspiration from the multifaceted capabilities of LLMs and Whisper, and focus on integrating multiple ASR text processing tasks related to speech recognition into the ASR model.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First Regularization

no code implementations • 7 Nov 2022 • Zhengkun Tian, Hongyu Xiang, Min Li, Feifei Lin, Ke Ding, Guanglu Wan

To reduce the peak latency, we propose a simple and novel method named peak-first regularization, which utilizes a frame-wise knowledge distillation function to force the probability distribution of the CTC model to shift left along the time axis instead of directly modifying the calculation process of CTC loss and gradients.

Knowledge Distillation

Paper
Add Code

CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR

1 code implementation • 31 Mar 2022 • Keyu An, Huahuan Zheng, Zhijian Ou, Hongyu Xiang, Ke Ding, Guanglu Wan

The simulation module is jointly trained with the ASR model using a self-supervised loss; the ASR model is optimized with the usual ASR loss, e. g., CTC-CRF as used in our experiments.

Chunking speech-recognition +1

307

Paper
Code

CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency

1 code implementation • 27 May 2020 • Keyu An, Hongyu Xiang, Zhijian Ou

In this paper, we present a new open source toolkit for speech recognition, named CAT (CTC-CRF based ASR Toolkit).

Ranked #1 on Speech Recognition on Hub5'00 FISHER-SWBD

speech-recognition Speech Recognition

307

Paper
Code

CAT: CRF-based ASR Toolkit

2 code implementations • 20 Nov 2019 • Keyu An, Hongyu Xiang, Zhijian Ou

In this paper, we present a new open source toolkit for automatic speech recognition (ASR), named CAT (CRF-based ASR Toolkit).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

307

Paper
Code

CRF-based Single-stage Acoustic Modeling with CTC Topology

1 code implementation • 16 Apr 2019 • Hongyu Xiang, Zhijian Ou

CTC-CRF is conceptually simple, which basically implements a CRF layer on top of features generated by the bottom neural network with the special state topology.

Ranked #2 on Speech Recognition on WSJ eval93

Benchmarking Speech Recognition

307

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.