Search Results for author: Bi-Cheng Yan

Found 12 papers, 1 papers with code

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition

no code implementations26 Mar 2024 Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin, Berlin Chen

End-to-end automatic speech recognition (E2E ASR) systems often suffer from mistranscription of domain-specific phrases, such as named entities, sometimes leading to catastrophic failures in downstream tasks.

Automatic Speech Recognition Language Modelling +2

Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment

no code implementations3 Oct 2023 Bi-Cheng Yan, Hsin-Wei Wang, Yi-Cheng Wang, Jiun-Ting Li, Chi-Han Lin, Berlin Chen

Automatic pronunciation assessment (APA) manages to quantify the pronunciation proficiency of a second language (L2) learner in a language.

regression

Effective Cross-Utterance Language Modeling for Conversational Speech Recognition

no code implementations5 Nov 2021 Bi-Cheng Yan, Hsin-Wei Wang, Shih-Hsuan Chiu, Hsuan-Sheng Chiu, Berlin Chen

Conversational speech normally is embodied with loose syntactic structures at the utterance level but simultaneously exhibits topical coherence relations across consecutive utterances.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis

no code implementations1 Nov 2021 Hsin-Wei Wang, Bi-Cheng Yan, Hsuan-Sheng Chiu, Yung-Chang Hsu, Berlin Chen

In addition, we design and develop a pronunciation modeling network stacked on top of the NAR E2E models of our method to further boost the effectiveness of MD&D.

Maximum F1-score training for end-to-end mispronunciation detection and diagnosis of L2 English speech

no code implementations31 Aug 2021 Bi-Cheng Yan, Shao-Wei Fan Jiang, Fu-An Chao, Berlin Chen

End-to-end (E2E) neural models are increasingly attracting attention as a promising modeling approach for mispronunciation detection and diagnosis (MDD).

Data Augmentation

TENET: A Time-reversal Enhancement Network for Noise-robust ASR

1 code implementation4 Jul 2021 Fu-An Chao, Shao-Wei Fan Jiang, Bi-Cheng Yan, Jeih-weih Hung, Berlin Chen

Due to the unprecedented breakthroughs brought about by deep learning, speech enhancement (SE) techniques have been developed rapidly and play an important role prior to acoustic modeling to mitigate noise effects on speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

End-to-End Mispronunciation Detection and Diagnosis From Raw Waveforms

no code implementations4 Mar 2021 Bi-Cheng Yan, Berlin Chen

Furthermore, our model can achieve comparable mispronunciation detection performance in relation to state-of-the-art E2E MDD models that take input the standard handcrafted acoustic features.

Cannot find the paper you are looking for? You can Submit a new open access paper.