Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning

11 Nov 2019Sathish IndurthiHoujeung HanNikhil Kumar LakumarapuBeomseok LeeInsoo ChungSangha KimChanwoo Kim

End-to-end Speech Translation (ST) models have several advantages such as lower latency, smaller model size, and less error compounding over conventional pipelines that combine Automatic Speech Recognition (ASR) and text Machine Translation (MT) models. However, collecting large amounts of parallel data for ST task is more difficult compared to the ASR and MT tasks... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.