Search Results for author: Roman Yangarber

Found 31 papers, 0 papers with code

Tools for supporting language learning for Sakha

no code implementations • WS (NoDaLiDa) 2019 • Sardana Ivanova, Anisia Katinskaia, Roman Yangarber

Revita is a freely available online language learning platform for learners beyond the beginner level.

Paper
Add Code

Semi-automatically Annotated Learner Corpus for Russian

no code implementations • LREC 2022 • Anisia Katinskaia, Maria Lebedeva, Jue Hou, Roman Yangarber

We present ReLCo— the Revita Learner Corpus—a new semi-automatically annotated learner corpus for Russian.

Grammatical Error Detection

Paper
Add Code

Projecting named entity recognizers without annotated or parallel corpora

no code implementations • WS (NoDaLiDa) 2019 • Jue Hou, Maximilian Koppatz, José María Hoya Quecedo, Roman Yangarber

Named entity recognition (NER) is a well-researched task in the field of NLP, which typically requires large annotated corpora for training usable models.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Assessing Grammatical Correctness in Language Learning

no code implementations • EACL (BEA) 2021 • Anisia Katinskaia, Roman Yangarber

We approach the problem with the methods for grammatical error detection (GED), since we hypothesize that models for detecting grammatical mistakes can assess the correctness of potential alternative answers in a learning setting.

Grammatical Error Detection LEMMA

Paper
Add Code

Applying Gamification Incentives in the Revita Language-learning System

no code implementations • games (LREC) 2022 • Jue Hou, Ilmari Kylliäinen, Anisia Katinskaia, Giacomo Furlan, Roman Yangarber

Our goal is to keep the learner engaged in long practice sessions over many months—rather than for the short-term.

Paper
Add Code

Slav-NER: the 3rd Cross-lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic Languages

no code implementations • EACL (BSNLP) 2021 • Jakub Piskorski, Bogdan Babych, Zara Kancheva, Olga Kanishcheva, Maria Lebedeva, Michał Marcińczuk, Preslav Nakov, Petya Osenova, Lidia Pivovarova, Senja Pollak, Pavel Přibáň, Ivaylo Radev, Marko Robnik-Sikonja, Vasyl Starko, Josef Steinberger, Roman Yangarber

Seven teams covered all six languages, and five teams participated in the cross-lingual entity linking task.

Cross-Lingual Entity Linking Entity Linking +3

Paper
Add Code

Cross-lingual Named Entity Corpus for Slavic Languages

no code implementations • 30 Mar 2024 • Jakub Piskorski, Michał Marcińczuk, Roman Yangarber

The corpus consists of 5 017 documents on seven topics.

LEMMA Lemmatization

Paper
Add Code

Effects of sub-word segmentation on performance of transformer language models

no code implementations • 9 May 2023 • Jue Hou, Anisia Katinskaia, Anh-Duc Vu, Roman Yangarber

Lastly, we show 4. that LMs of smaller size using morphological segmentation can perform comparably to models of larger size trained with BPE -- both in terms of (1) perplexity and (3) scores on downstream tasks.

Language Modelling Segmentation

Paper
Add Code

Linguistic Constructs as the Representation of the Domain Model in an Intelligent Language Tutoring System

no code implementations • 3 Dec 2022 • Anisia Katinskaia, Jue Hou, Anh-Duc Vu, Roman Yangarber

This paper presents the development of an AI-based language learning platform Revita.

Paper
Add Code

Question Answering and Question Generation for Finnish

no code implementations • 24 Nov 2022 • Ilmari Kylliäinen, Roman Yangarber

We present the first neural QA and QG models that work with Finnish.

Language Modelling Question Answering +2

Paper
Add Code

Neural disambiguation of lemma and part of speech in morphologically rich languages

no code implementations • LREC 2020 • José María Hoya Quecedo, Maximilian W. Koppatz, Giacomo Furlan, Roman Yangarber

We consider the problem of disambiguating the lemma and part of speech of ambiguous words in morphologically rich languages.

LEMMA POS

Paper
Add Code

Toward a Paradigm Shift in Collection of Learner Corpora

no code implementations • LREC 2020 • Anisia Katinskaia, Sardana Ivanova, Roman Yangarber

We present the first version of the longitudinal Revita Learner Corpus (ReLCo), for Russian.

Paper
Add Code

Modeling language learning using specialized Elo rating

no code implementations • WS 2019 • Jue Hou, Koppatz Maximilian, Jos{\'e} Mar{\'\i}a Hoya Quecedo, Nataliya Stoyanova, Roman Yangarber

This application of Elo provides ratings for learners and concepts which correlate well with subjective proficiency levels of the learners and difficulty levels of the concepts.

Paper
Add Code

The Second Cross-Lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic Languages

no code implementations • WS 2019 • Jakub Piskorski, Laska Laskova, Micha{\l} Marci{\'n}czuk, Lidia Pivovarova, Pavel P{\v{r}}ib{\'a}{\v{n}}, Josef Steinberger, Roman Yangarber

The task is recognizing mentions of named entities in Web documents, their normalization, and cross-lingual linking.

Cross-Lingual Entity Linking Entity Linking +3

Paper
Add Code

Comparison of Representations of Named Entities for Document Classification

no code implementations • WS 2018 • Lidia Pivovarova, Roman Yangarber

We explore representations for multi-word names in text classification tasks, on Reuters (RCV1) topic and sector classification.

Document Classification General Classification +4

Paper
Add Code

Benchmarks and models for entity-oriented polarity detection

no code implementations • NAACL 2018 • Lidia Pivovarova, Arto Klami, Roman Yangarber

We address the problem of determining entity-oriented polarity in business news.

Sentiment Analysis Transfer Learning

Paper
Add Code

Revita: a Language-learning Platform at the Intersection of ITS and CALL

no code implementations • LREC 2018 • Anisia Katinskaia, Javad Nouri, Roman Yangarber

Language Acquisition

Paper
Add Code

HCS at SemEval-2017 Task 5: Polarity detection in business news using convolutional neural networks

no code implementations • SEMEVAL 2017 • Lidia Pivovarova, Lloren{\c{c}} Escoter, Arto Klami, Roman Yangarber

Task 5 of SemEval-2017 involves fine-grained sentiment analysis on financial microblogs and news.

Data Augmentation Sentiment Analysis

Paper
Add Code

Revita: a system for language learning and supporting endangered languages

no code implementations • WS 2017 • Anisia Katinskaia, Javad Nouri, Roman Yangarber

Language Acquisition

Paper
Add Code

The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages

no code implementations • WS 2017 • Jakub Piskorski, Lidia Pivovarova, Jan {\v{S}}najder, Josef Steinberger, Roman Yangarber

The reported evaluation figures reflect the relatively higher level of complexity of named entity-related tasks in the context of processing texts in Slavic languages.

Entity Linking Lemmatization +3