no code implementations • 12 Oct 2022 • Thibault Sellam, Ankur Bapna, Joshua Camp, Diana Mackinnon, Ankur P. Parikh, Jason Riesa
The main insight is that training one model on many locales consistently outperforms mono-locale baselines.
Diversity