Lemmatization

24 papers with code ยท Natural Language Processing

Leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Latest papers without code

Neural Polysynthetic Language Modelling

11 May 2020

In the literature, languages like Finnish or Turkish are held up as extreme examples of complexity that challenge common modelling assumptions.

LANGUAGE MODELLING LEMMATIZATION MACHINE TRANSLATION

A Resource for Studying Chatino Verbal Morphology

5 Apr 2020

We present the first resource focusing on the verbal inflectional morphology of San Juan Quiahije Chatino, a tonal mesoamerican language spoken in Mexico.

LEMMATIZATION MORPHOLOGICAL ANALYSIS MORPHOLOGICAL INFLECTION

\'UFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning Representation Parsing Shared Task

CONLL 2019

We present a system description of our contribution to the CoNLL 2019 shared task, CrossFramework Meaning Representation Parsing (MRP 2019).

DEPENDENCY PARSING LEMMATIZATION SEMANTIC PARSING

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

WS 2019

The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages.

CROSS-LINGUAL TRANSFER LEMMATIZATION MORPHOLOGICAL ANALYSIS MORPHOLOGICAL INFLECTION TRANSFER LEARNING

Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging

5 Oct 2019

Semitic languages can be highly ambiguous, having several interpretations of the same surface forms, and morphologically rich, having many morphemes that realize several morphological features.

LEMMATIZATION MORPHOLOGICAL TAGGING

Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER

8 Sep 2019

We evaluate two meth ods for precomputing such embeddings, BERT and Flair, on four Czech text processing tasks: part-of-speech (POS) tagging, lemmatization, dependency pars ing and named entity recognition (NER).

DEPENDENCY PARSING LEMMATIZATION NAMED ENTITY RECOGNITION PART-OF-SPEECH TAGGING

Corpora and Processing Tools for Non-standard Contemporary and Diachronic Balkan Slavic

RANLP 2019

The paper describes three corpora of different varieties of BS that are currently being developed with the goal of providing data for the analysis of the diatopic and diachronic variation in non-standard Balkan Slavic.

LEMMATIZATION