Search Results for author: Maarten Janssen

Found 8 papers, 0 papers with code

Information Acquisition and Diffusion in Markets

no code implementations30 Sep 2021 Atabek Atayev, Maarten Janssen

Consumers can acquire information through their own search efforts or through their social network.

The COPLE2 corpus: a learner corpus for Portuguese

no code implementations LREC 2016 Am{\'a}lia Mendes, S Antunes, ra, Maarten Janssen, Anabela Gon{\c{c}}alves

We present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts produced by learners of Portuguese as a second or foreign language.

Lemmatization POS

TEITOK: Text-Faithful Annotated Corpora

no code implementations LREC 2016 Maarten Janssen

TEITOK is a web-based framework for corpus creation, annotation, and distribution, that combines textual and linguistic annotation within a single TEI based XML document.

NeoTag: a POS Tagger for Grammatical Neologism Detection

no code implementations LREC 2012 Maarten Janssen

This tagger, called NeoTag, has an overall accuracy that is comparable to other taggers, but scores much better for grammatical neologisms.

Lemmatization POS +1

The Common Orthographic Vocabulary of the Portuguese Language: a set of open lexical resources for a pluricentric language

no code implementations LREC 2012 Jos{\'e} Pedro Ferreira, Maarten Janssen, Gladis Barcellos de Oliveira, Margarita Correia, Gilvan M{\"u}ller de Oliveira

This paper outlines the design principles and choices, as well as the ongoing development process of the Common Orthographic Vocabulary of the Portuguese Language (VOC), a large scale electronic lexical database which was adopted by the Community of Portuguese-Speaking Countries' (CPLP) Instituto Internacional da L{\'\i}ngua Portuguesa to implement a spelling reform that is currently taking place.

Cannot find the paper you are looking for? You can Submit a new open access paper.