no code implementations • WS (NoDaLiDa) 2019 • Sardana Ivanova, Anisia Katinskaia, Roman Yangarber
Revita is a freely available online language learning platform for learners beyond the beginner level.
no code implementations • LREC 2022 • Anisia Katinskaia, Maria Lebedeva, Jue Hou, Roman Yangarber
We present ReLCo— the Revita Learner Corpus—a new semi-automatically annotated learner corpus for Russian.
no code implementations • WS (NoDaLiDa) 2019 • Jue Hou, Maximilian Koppatz, José María Hoya Quecedo, Roman Yangarber
Named entity recognition (NER) is a well-researched task in the field of NLP, which typically requires large annotated corpora for training usable models.
no code implementations • EACL (BEA) 2021 • Anisia Katinskaia, Roman Yangarber
We approach the problem with the methods for grammatical error detection (GED), since we hypothesize that models for detecting grammatical mistakes can assess the correctness of potential alternative answers in a learning setting.
no code implementations • games (LREC) 2022 • Jue Hou, Ilmari Kylliäinen, Anisia Katinskaia, Giacomo Furlan, Roman Yangarber
Our goal is to keep the learner engaged in long practice sessions over many months—rather than for the short-term.
no code implementations • EACL (BSNLP) 2021 • Jakub Piskorski, Bogdan Babych, Zara Kancheva, Olga Kanishcheva, Maria Lebedeva, Michał Marcińczuk, Preslav Nakov, Petya Osenova, Lidia Pivovarova, Senja Pollak, Pavel Přibáň, Ivaylo Radev, Marko Robnik-Sikonja, Vasyl Starko, Josef Steinberger, Roman Yangarber
Seven teams covered all six languages, and five teams participated in the cross-lingual entity linking task.
no code implementations • 30 Mar 2024 • Jakub Piskorski, Michał Marcińczuk, Roman Yangarber
The corpus consists of 5 017 documents on seven topics.
no code implementations • 9 May 2023 • Jue Hou, Anisia Katinskaia, Anh-Duc Vu, Roman Yangarber
Lastly, we show 4. that LMs of smaller size using morphological segmentation can perform comparably to models of larger size trained with BPE -- both in terms of (1) perplexity and (3) scores on downstream tasks.
no code implementations • 3 Dec 2022 • Anisia Katinskaia, Jue Hou, Anh-Duc Vu, Roman Yangarber
This paper presents the development of an AI-based language learning platform Revita.
no code implementations • 24 Nov 2022 • Ilmari Kylliäinen, Roman Yangarber
We present the first neural QA and QG models that work with Finnish.
no code implementations • LREC 2020 • José María Hoya Quecedo, Maximilian W. Koppatz, Giacomo Furlan, Roman Yangarber
We consider the problem of disambiguating the lemma and part of speech of ambiguous words in morphologically rich languages.
no code implementations • LREC 2020 • Anisia Katinskaia, Sardana Ivanova, Roman Yangarber
We present the first version of the longitudinal Revita Learner Corpus (ReLCo), for Russian.
no code implementations • WS 2019 • Jue Hou, Koppatz Maximilian, Jos{\'e} Mar{\'\i}a Hoya Quecedo, Nataliya Stoyanova, Roman Yangarber
This application of Elo provides ratings for learners and concepts which correlate well with subjective proficiency levels of the learners and difficulty levels of the concepts.
no code implementations • WS 2019 • Jakub Piskorski, Laska Laskova, Micha{\l} Marci{\'n}czuk, Lidia Pivovarova, Pavel P{\v{r}}ib{\'a}{\v{n}}, Josef Steinberger, Roman Yangarber
The task is recognizing mentions of named entities in Web documents, their normalization, and cross-lingual linking.
no code implementations • WS 2018 • Lidia Pivovarova, Roman Yangarber
We explore representations for multi-word names in text classification tasks, on Reuters (RCV1) topic and sector classification.
no code implementations • NAACL 2018 • Lidia Pivovarova, Arto Klami, Roman Yangarber
We address the problem of determining entity-oriented polarity in business news.
no code implementations • SEMEVAL 2017 • Lidia Pivovarova, Lloren{\c{c}} Escoter, Arto Klami, Roman Yangarber
Task 5 of SemEval-2017 involves fine-grained sentiment analysis on financial microblogs and news.
no code implementations • WS 2017 • Jakub Piskorski, Lidia Pivovarova, Jan {\v{S}}najder, Josef Steinberger, Roman Yangarber
The reported evaluation figures reflect the relatively higher level of complexity of named entity-related tasks in the context of processing texts in Slavic languages.
no code implementations • EACL 2017 • Lloren{\c{c}} Escoter, Lidia Pivovarova, Mian Du, Anisia Katinskaia, Roman Yangarber
In news aggregation systems focused on broad news domains, certain stories may appear in multiple articles.
no code implementations • LREC 2016 • Javad Nouri, Roman Yangarber
Unsupervised learning of morphological segmentation of words in a language, based only on a large corpus of words, is a challenging task.