Search Results for author: Kaixun Huang

Found 6 papers, 0 papers with code

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

no code implementations20 Aug 2024 Tianyi Xu, Kaixun Huang, Pengcheng Guo, Yu Zhou, Longtao Huang, Hui Xue, Lei Xie

Pre-trained multilingual speech foundation models, like Whisper, have shown impressive performance across different languages.

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

no code implementations15 Dec 2023 Ao Zhang, Pan Zhou, Kaixun Huang, Yong Zou, Ming Liu, Lei Xie

Open-vocabulary keyword spotting (KWS), which allows users to customize keywords, has attracted increasingly more interest.

Decoder Keyword Spotting

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

no code implementations7 Oct 2023 Kaixun Huang, Ao Zhang, BinBin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

However, unlike shallow fusion methods that directly bias the posterior of the ASR model, deep biasing methods implicitly integrate contextual information, making it challenging to control the degree of bias.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

no code implementations1 Jun 2023 Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

By incorporating additional contextual information, deep biasing methods have emerged as a promising solution for speech recognition of personalized words.

speech-recognition Speech Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.