Phone-level pronunciation scoring
5 papers with code • 1 benchmarks • 1 datasets
Most implemented papers
speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children.
A transfer learning based approach for pronunciation scoring
Phone-level pronunciation scoring is a challenging task, with performance far from that of human annotators.
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Automatic pronunciation assessment is an important technology to help self-directed language learners.
Hierarchical Pronunciation Assessment with Multi-Aspect Attention
In this paper, we propose a Hierarchical Pronunciation Assessment with Multi-aspect Attention (HiPAMA) model, which hierarchically represents the granularity levels to directly capture their linguistic structures and introduces multi-aspect attention that reflects associations across aspects at the same level to create more connotative representations.
Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
In the first step, the pre-trained SSL model is fine-tuned on a phoneme recognition task to obtain better representations for the pronounced phonemes.