no code implementations • 12 Sep 2023 • Juntae Kim, Minkyu Lim, Seokjin Hong
Inverse text normalization (ITN) is crucial for converting spoken-form into written-form, especially in the context of automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 21 Jun 2019 • Minkyu Lim, Ji-Hwan Kim
By contrast, a general-purpose deep learning framework, such as TensorFlow, can easily build various types of neural network architectures using a tensor-based computation method, but it is difficult to apply them to WFST-based speech recognition.
no code implementations • 11 Jul 2018 • Hosung Park, Dong-Hyun Lee, Minkyu Lim, Yoseb Kang, Juneseok Oh, Ji-Hwan Kim
In this paper, a time delay neural network (TDNN) based acoustic model is proposed to implement a fast-converged acoustic modeling for Korean speech recognition.