automatic-speech-translation

6 papers with code • 2 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

End-to-End Automatic Speech Translation of Audiobooks

alicank/Translation-Augmented-LibriSpeech-Corpus 12 Feb 2018

We investigate end-to-end speech-to-text translation on a corpus of audiobooks specifically augmented for this task.

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

ze1gades/diary 27 Feb 2020

Our method compares favorably to SpecAugment on English$\to$French and English$\to$Romanian automatic speech translation (AST) tasks as well as on a low-resource English automatic speech recognition (ASR) task.

Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts

hubreb/imitkd_ast 17 Jul 2023

We present an imitation learning approach where a teacher NMT system corrects the errors of an AST student without relying on manual transcripts.

Seamless: Multilingual Expressive and Streaming Speech Translation

facebookresearch/seamless_communication 8 Dec 2023

In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion.

MooER: LLM-based Speech Recognition and Translation Models from Moore Threads

moorethreads/mooer 9 Aug 2024

We achieve performance comparable to other open source models trained with up to hundreds of thousands of hours of labeled speech data.

BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages

ai4bharat/bhasaanuvaad 7 Nov 2024

To this end, we introduce BhasaAnuvaad, the largest publicly available dataset for AST involving 13 out of 22 scheduled Indian languages and English spanning over 44, 400 hours and 17M text segments.