Polyphone disambiguation

3 papers with code • 1 benchmarks • 1 datasets

A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.

Datasets


Most implemented papers

g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

kakaobrain/g2pM 7 Apr 2020

Conversion of Chinese graphemes to phonemes (G2P) is an essential component in Mandarin Chinese Text-To-Speech (TTS) systems.

g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin

GitYCC/g2pW 20 Mar 2022

Polyphone disambiguation is the most crucial task in Mandarin grapheme-to-phoneme (g2p) conversion.

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

zain-jiang/dict-tts 5 Jun 2022

This paper tackles the polyphone disambiguation problem from a concise and novel perspective: we propose Dict-TTS, a semantic-aware generative text-to-speech model with an online website dictionary (the existing prior information in the natural language).