no code implementations • 25 Mar 2024 • Luca Malagutti, Andrius Buinovskij, Anej Svete, Clara Meister, Afra Amini, Ryan Cotterell
For nearly three decades, language models derived from the $n$-gram assumption held the state of the art on the task.
Language Modelling Machine Translation