1 code implementation • 12 Jun 2023 • Andani Madodonga, Vukosi Marivate, Matthew Adendorff
Due to the shortage of data for these native South African languages, the datasets that were created were augmented and oversampled to increase data size and overcome class classification imbalance.
2 code implementations • 7 Mar 2023 • Richard Lastrucci, Isheanesu Dzingirai, Jenalea Rajab, Andani Madodonga, Matimba Shingange, Daniel Njini, Vukosi Marivate
This paper introduces two multilingual government themed corpora in various South African languages.