Search Results for author: Cihan Xiao

Found 2 papers, 1 papers with code

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation

1 code implementation20 Jun 2023 Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur

We introduce HK-LegiCoST, a new three-way parallel corpus of Cantonese-English translations, containing 600+ hours of Cantonese audio, its standard traditional Chinese transcript, and English translation, segmented and aligned at the sentence level.

Cross-corpus Sentence +3

Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning

no code implementations31 May 2023 Shuyue Stella Li, Cihan Xiao, Tianjian Li, Bismarck Odoom

Our methods include a stacked Residual CNN+GRU model and a multitask pre-training approach to use Automatic Speech Recognition (ASR) as an auxiliary task for CSLID.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Cannot find the paper you are looking for? You can Submit a new open access paper.