no code implementations • 17 Oct 2023 • Vishal Sunder, Beulah Karrolla, Eric Fosler-Lussier
To train this pointer network, we generate ground truth training signals by using forced alignment between the read speech and the text being read on the training set.