Search Results for author: I-Fan Chen

Found 8 papers, 0 papers with code

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

no code implementations • 26 Sep 2023 • Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring.

Language Modelling Large Language Model +2

Paper
Add Code

An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition

no code implementations • 11 Oct 2022 • Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee

We evaluate three end-to-end deep models, including LAS, hybrid CTC/attention, and RNN transducer, on the open-source LibriSpeech and TIMIT corpora.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

no code implementations • 22 Jul 2022 • Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke

As for other forms of AI, speech recognition has recently been examined with respect to performance disparities across different user cohorts.

Fairness speech-recognition +1

Paper
Add Code

Investigation of Training Label Error Impact on RNN-T

no code implementations • 1 Dec 2021 • I-Fan Chen, Brian King, Jasha Droppo

In this paper, we propose an approach to quantitatively analyze impacts of different training label errors to RNN-T based ASR models.

Paper
Add Code

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition

no code implementations • 30 Jun 2020 • Maarten Van Segbroeck, Harish Mallidih, Brian King, I-Fan Chen, Gurpreet Chadha, Roland Maas

Acoustic models in real-time speech recognition systems typically stack multiple unidirectional LSTM layers to process the acoustic frames over time.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Data Techniques For Online End-to-end Speech Recognition

no code implementations • 24 Jan 2020 • Yang Chen, Weiran Wang, I-Fan Chen, Chao Wang

Practitioners often need to build ASR systems for new use cases in a short amount of time, given limited in-domain data.

Data Augmentation Domain Adaptation +3

Paper
Add Code

End-to-end Anchored Speech Recognition

no code implementations • 6 Feb 2019 • Yiming Wang, Xing Fan, I-Fan Chen, Yuzong Liu, Tongfei Chen, Björn Hoffmeister

The anchored segment refers to the wake-up word part of an audio stream, which contains valuable speaker information that can be used to suppress interfering speech and background noise.

Multi-Task Learning speech-recognition +1

Paper
Add Code

Maximum a Posteriori Adaptation of Network Parameters in Deep Models

no code implementations • 6 Mar 2015 • Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jiadong Wu, Chin-Hui Lee

We present a Bayesian approach to adapting parameters of a well-trained context-dependent, deep-neural-network, hidden Markov model (CD-DNN-HMM) to improve automatic speech recognition performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.