Polyphone disambiguation

3 papers with code • 1 benchmarks • 1 datasets

A part of the TTS-front end framework which serves to predict the correct pronunciation for the input polyphone characters.

Benchmarks

Add a Result

These leaderboards are used to track progress in Polyphone disambiguation

Trend	Dataset	Best Model	Paper	Code	Compare
	CPP	g2pW			See all

Datasets

Most implemented papers

Most implemented Social Latest No code

g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

kakaobrain/g2pM • 7 Apr 2020

Conversion of Chinese graphemes to phonemes (G2P) is an essential component in Mandarin Chinese Text-To-Speech (TTS) systems.

Paper
Code

g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin

GitYCC/g2pW • • 20 Mar 2022

Polyphone disambiguation is the most crucial task in Mandarin grapheme-to-phoneme (g2p) conversion.

Paper
Code

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

zain-jiang/dict-tts • • 5 Jun 2022

This paper tackles the polyphone disambiguation problem from a concise and novel perspective: we propose Dict-TTS, a semantic-aware generative text-to-speech model with an online website dictionary (the existing prior information in the natural language).

Paper
Code

Polyphone disambiguation

Benchmarks Add a Result

Datasets

Most implemented papers

g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin

Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

Content

Benchmarks

Add a Result