no code implementations • 30 Sep 2021 • Atabek Atayev, Maarten Janssen
Consumers can acquire information through their own search efforts or through their social network.
no code implementations • LREC 2016 • Am{\'a}lia Mendes, S Antunes, ra, Maarten Janssen, Anabela Gon{\c{c}}alves
We present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts produced by learners of Portuguese as a second or foreign language.
no code implementations • LREC 2016 • Maarten Janssen
TEITOK is a web-based framework for corpus creation, annotation, and distribution, that combines textual and linguistic annotation within a single TEI based XML document.
no code implementations • LREC 2012 • Maarten Janssen
This tagger, called NeoTag, has an overall accuracy that is comparable to other taggers, but scores much better for grammatical neologisms.
no code implementations • LREC 2012 • Jos{\'e} Pedro Ferreira, Maarten Janssen, Gladis Barcellos de Oliveira, Margarita Correia, Gilvan M{\"u}ller de Oliveira
This paper outlines the design principles and choices, as well as the ongoing development process of the Common Orthographic Vocabulary of the Portuguese Language (VOC), a large scale electronic lexical database which was adopted by the Community of Portuguese-Speaking Countries' (CPLP) Instituto Internacional da L{\'\i}ngua Portuguesa to implement a spelling reform that is currently taking place.