CoVoST is a large-scale multilingual speech-to-text translation corpus. Its latest 2nd version covers translations from 21 languages into English and from English into 15 languages. It has total 2880 hours of speech and is diversified with 78K speakers and 66 accents.

Source: CoVoST 2 and Massively Multilingual Speech-to-Text Translation


Paper Code Results Date Stars

Dataset Loaders


Similar Datasets