ArzEn (Corpus of Egyptian Arabic-English Code-switching)

Introduced by Hamed et al. in Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition

Corpus of Egyptian Arabic-English Code-switching (ArzEn) is a spontaneous conversational speech corpus, obtained through informal interviews held at the German University in Cairo. The participants discussed broad topics, including education, hobbies, work, and life experiences. The corpus currently contains 12 hours of speech, having 6,216 utterances. The recordings were transcribed and translated into monolingual Egyptian Arabic and monolingual English.

Source: https://arxiv.org/pdf/2211.16319v1.pdf

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


License


  • Unknown

Modalities


Languages