Polyphone disambiguation
3 papers with code • 1 benchmarks • 1 datasets
A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.
Most implemented papers
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Conversion of Chinese graphemes to phonemes (G2P) is an essential component in Mandarin Chinese Text-To-Speech (TTS) systems.
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin
Polyphone disambiguation is the most crucial task in Mandarin grapheme-to-phoneme (g2p) conversion.
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
This paper tackles the polyphone disambiguation problem from a concise and novel perspective: we propose Dict-TTS, a semantic-aware generative text-to-speech model with an online website dictionary (the existing prior information in the natural language).