Search Results for author: Zhanheng Yang

Found 5 papers, 2 papers with code

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

no code implementations • 1 Jun 2023 • Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

By incorporating additional contextual information, deep biasing methods have emerged as a promising solution for speech recognition of personalized words.

speech-recognition Speech Recognition

Paper
Add Code

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

no code implementations • 21 May 2023 • Kaixun Huang, Ao Zhang, Zhanheng Yang, Pengcheng Guo, Bingshen Mu, Tianyi Xu, Lei Xie

In this study, we introduce a contextual phrase prediction network for an attention-based deep bias method.

speech-recognition Speech Recognition

Paper
Add Code

Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer

no code implementations • 17 Jan 2023 • Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, Lei Xie

In this paper, we propose an efficient approach to obtain a high quality contextual list for a unified streaming/non-streaming based E2E model.

Paper
Add Code

The NPU System for the 2020 Personalized Voice Trigger Challenge

1 code implementation • 26 Feb 2021 • Jingyong Hou, Li Zhang, Yihui Fu, Qing Wang, Zhanheng Yang, Qijie Shao, Lei Xie

This paper describes the system developed by the NPU team for the 2020 personalized voice trigger challenge.

Small-Footprint Keyword Spotting Speaker Verification

10,084

Paper
Code

IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines

1 code implementation • 4 Nov 2020 • Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, DongYan Huang, Hui Bu, Petr Motlicek, Jean-Marc Odobez

In this challenge, we open source a sizable speech, keyword, echo and noise corpus for promoting data-driven methods, particularly deep-learning approaches on KWS and SSL.

Sound Audio and Speech Processing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.