2 code implementations • COLING 2020 • Arianna Betti, Martin Reynaert, Thijs Ossenkoppele, Yvette Oortwijn, Andrew Salway, Jelke Bloem
We present a novel, domain expert-controlled, replicable procedure for the construction of concept-modeling ground truths with the aim of evaluating the application of word embeddings.
no code implementations • LREC 2020 • Jelke Bloem, Maria Chiara Parisi, Martin Reynaert, Yvette Oortwijn, Arianna Betti
Our results show that consistent Neo-Latin word embeddings can be learned from this type of data.
no code implementations • LREC 2016 • Martin Reynaert
We present further work on evaluation of the fully automatic post-correction of Early Dutch Books Online, a collection of 10, 333 18th century books.
no code implementations • LREC 2016 • Hennie Brugman, Martin Reynaert, Nicoline van der Sijs, Ren{\'e} van Stipriaan, Erik Tjong Kim Sang, Antal Van den Bosch
The Nederlab project aims to bring together all digitized texts relevant to the Dutch national heritage, the history of the Dutch language and culture (circa 800 {--} present) in one user friendly and tool enriched open access web interface.
no code implementations • LREC 2014 • Martin Reynaert
We then move to evaluating the new TICCL port on a very large corpus of Dutch books known as EDBO, digitized by the Dutch National Library.
no code implementations • LREC 2012 • Martin Reynaert, Ineke Schuurman, V{\'e}ronique Hoste, Nelleke Oostdijk, Maarten van Gompel
In this paper we report on the experiences gained in the recent construction of the SoNaR corpus, a 500 MW reference corpus of contemporary, written Dutch.