1 code implementation • 25 Oct 2022 • Sudhanshu Ranjan, Dheeraj Mekala, Jingbo Shang
Instead of training on the entire code-switched corpus at once, we create buckets based on the fraction of words in the resource-rich language and progressively train from resource-rich language dominated samples to low-resource language dominated samples.