Search Results for author: Shuangyu Chang

Found 9 papers, 0 papers with code

External Language Model Integration for Factorized Neural Transducers

no code implementations • 26 May 2023 • Michael Levit, Sarangarajan Parthasarathy, Cem Aksoylar, Mohammad Sadegh Rasooli, Shuangyu Chang

We propose an adaptation method for factorized neural transducers (FNT) with external language models.

Language Modelling

Paper
Add Code

Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context for Continuous Speech Recognition

no code implementations • 10 Jan 2023 • Piyush Behre, Sharman Tan, Padma Varadharajan, Shuangyu Chang

While speech recognition Word Error Rate (WER) has reached human parity for English, continuous speech recognition scenarios such as voice typing and meeting transcriptions still suffer from segmentation and punctuation problems, resulting from irregular pausing patterns or slow speakers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

no code implementations • 27 Oct 2022 • Piyush Behre, Sharman Tan, Amy Shah, Harini Kesavamoorthy, Shuangyu Chang, Fei Zuo, Chris Basoglu, Sayan Pathak

Punctuation and Segmentation are key to readability in Automatic Speech Recognition (ASR), often evaluated using F1 scores that require high-quality human transcripts and do not reflect readability well.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

no code implementations • 26 Oct 2022 • Piyush Behre, Naveen Parihar, Sharman Tan, Amy Shah, Eva Sharma, Geoffrey Liu, Shuangyu Chang, Hosam Khalil, Chris Basoglu, Sayan Pathak

For the downstream task of machine translation, it improves the translation BLEU score by an average of 1. 05 points.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition

no code implementations • 26 Oct 2022 • Sharman Tan, Piyush Behre, Nick Kibre, Issac Alphonso, Shuangyu Chang

Features such as punctuation, capitalization, and formatting of entities are important for readability, understanding, and natural language processing tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Streaming Punctuation for Long-form Dictation with Transformers

no code implementations • 11 Oct 2022 • Piyush Behre, Sharman Tan, Padma Varadharajan, Shuangyu Chang

While speech recognition Word Error Rate (WER) has reached human parity for English, long-form dictation scenarios still suffer from segmentation and punctuation problems resulting from irregular pausing patterns or slow speakers.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages

no code implementations • 8 Sep 2022 • Li Miao, Jian Wu, Piyush Behre, Shuangyu Chang, Sarangarajan Parthasarathy

It is challenging to train and deploy Transformer LMs for hybrid speech recognition 2nd pass re-ranking in low-resource languages due to (1) data scarcity in low-resource languages, (2) expensive computing costs for training and refreshing 100+ monolingual models, and (3) hosting inefficiency considering sparse traffic.

Language Modelling Re-Ranking +2

Paper
Add Code

LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition

no code implementations • 21 Oct 2020 • Xie Chen, Sarangarajan Parthasarathy, William Gale, Shuangyu Chang, Michael Zeng

The context information is captured by the hidden states of LSTM-LMs across utterance and can be used to guide the first-pass search effectively.

speech-recognition Speech Recognition

Paper
Add Code

Long-span language modeling for speech recognition

no code implementations • 11 Nov 2019 • Sarangarajan Parthasarathy, William Gale, Xie Chen, George Polovets, Shuangyu Chang

We conduct language modeling and speech recognition experiments on the publicly available LibriSpeech corpus.

Language Modelling Re-Ranking +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.