no code implementations • IWSLT (ACL) 2022 • Jinyi Yang, Amir Hussein, Matthew Wiesner, Sanjeev Khudanpur
This paper details the Johns Hopkins speech translation (ST) system used in the IWLST2022 dialect speech translation task.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 20 Jun 2023 • Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur
We introduce HK-LegiCoST, a new three-way parallel corpus of Cantonese-English translations, containing 600+ hours of Cantonese audio, its standard traditional Chinese transcript, and English translation, segmented and aligned at the sentence level.
no code implementations • 8 Apr 2019 • Xiaofei Wang, Jinyi Yang, Ruizhi Li, Samik Sadhu, Hynek Hermansky
Quality of data plays an important role in most deep learning tasks.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 5 Feb 2017 • Chunxi Liu, Jinyi Yang, Ming Sun, Santosh Kesiraju, Alena Rott, Lucas Ondel, Pegah Ghahremani, Najim Dehak, Lukas Burget, Sanjeev Khudanpur
Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations.