no code implementations • LREC 2014 • Christian Buck, Kenneth Heafield, Bas van Ooyen
We contribute 5-gram counts and language models trained on the Common Crawl corpus, a collection over 9 billion web pages.
Language Modelling Machine Translation +1