no code implementations • 29 Nov 2022 • Christoph Minixhofer, Ondřej Klejch, Peter Bell
While modern Text-to-Speech (TTS) systems can produce natural-sounding speech, they remain unable to reproduce the full diversity found in natural speech data.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 15 Dec 2021 • Christoph Minixhofer, Ondřej Klejch, Peter Bell
In this work, we unify several existing decoding strategies for punctuation prediction in one framework and introduce a novel strategy which utilises multiple predictions at each word across different windows.