The training set is translated by a strong machine translation system and the test set is translated by human.
How to learn a better speech representation for end-to-end speech-to-text translation (ST) with limited labeled data?
For offline speech translation, our best end-to-end model achieves 8. 1 BLEU improvements over the benchmark on the MuST-C test set and is even approaching the results of a strong cascade solution.
NeurST is an open-source toolkit for neural speech translation.
Ranked #1 on Speech-to-Text Translation on libri-trans
Can we build a system to fully utilize signals in a parallel ST corpus?
We propose the variational template machine (VTM), a novel method to generate text descriptions from data tables.