no code implementations • 29 Sep 2023 • Alexandra Antonova
We present a first large-scale public synthetic dataset for contextual spellchecking customization of automatic speech recognition (ASR) with focus on diverse rare and out-of-vocabulary (OOV) phrases, such as proper names or terms.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 4 Jun 2023 • Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg
Contextual spelling correction models are an alternative to shallow fusion to improve automatic speech recognition (ASR) quality given user vocabulary.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 29 Jul 2022 • Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg
The model is trained on the Google Text Normalization dataset and achieves state-of-the-art sentence accuracy on both English and Russian test sets.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4