CoVoST is a large-scale multilingual speech-to-text translation corpus. Its latest 2nd version covers translations from 21 languages into English and from English into 15 languages. It has total 2880 hours of speech and is diversified with 78K speakers and 66 accents.

Source: CoVoST 2 and Massively Multilingual Speech-to-Text Translation

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets