1 code implementation • 6 Apr 2021 • Kathleen Siminyu, Godson Kalipe, Davor Orlic, Jade Abbott, Vukosi Marivate, Sackey Freshia, Prateek Sibal, Bhanu Neupane, David I. Adelani, Amelia Taylor, Jamiil Toure Ali, Kevin Degila, Momboladji Balogoun, Thierno Ibrahima DIOP, Davis David, Chayma Fourati, Hatem Haddad, Malek Naski
Advances in speech and language technologies enable tools such as voice-search, text-to-speech, speech recognition and machine translation.
We investigate how and when this training condition affects the final quality and intelligibility of a translation.
Yor\`ub\'a is a widely spoken West African language with a writing system rich in orthographic and tonal diacritics.
In this paper we focus on two African languages, Yor\`ub\'a and Twi, and compare the word embeddings obtained in this way, with word embeddings obtained from curated corpora and a language-dependent processing.