1 code implementation • SemEval (NAACL) 2022 • Amina Miftahova, Alexander Pugachev, Artem Skiba, Katya Artemova, Tatiana Batura, Pavel Braslavski, Vladimir Ivanov
The first approach follows the token classification schema, in which each token is assigned with a tag.
1 code implementation • LREC 2022 • Natalia Loukachevitch, Pavel Braslavski, Vladimir Ivanov, Tatiana Batura, Suresh Manandhar, Artem Shelmanov, Elena Tutubalina
In this paper, we describe entity linking annotation over nested named entities in the recently released Russian NEREL dataset for information extraction.
1 code implementation • 6 Oct 2023 • Anna Marshalova, Elena Bruches, Tatiana Batura
Being able to extract from scientific papers their main points, key insights, and other important information, referred to here as aspects, might facilitate the process of conducting a scientific literature review.
no code implementations • sdp (COLING) 2022 • Sergey Berezin, Tatiana Batura
After that, this model is used to mask named entities in the text and the BART model is trained to reconstruct them.
1 code implementation • 21 Oct 2022 • Natalia Loukachevitch, Suresh Manandhar, Elina Baral, Igor Rozhkov, Pavel Braslavski, Vladimir Ivanov, Tatiana Batura, Elena Tutubalina
NEREL-BIO provides annotation for nested named entities as an extension of the scheme employed for NEREL.
no code implementations • COLING 2022 • Elena Bruches, Olga Tikhobaeva, Yana Dementyeva, Tatiana Batura
This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities.
1 code implementation • 23 May 2022 • Ekaterina Artemova, Maxim Zmeev, Natalia Loukachevitch, Igor Rozhkov, Tatiana Batura, Vladimir Ivanov, Elena Tutubalina
In the test set the frequency of all entity types is even.
no code implementations • 14 Sep 2021 • Elena Bruches, Anastasia Mezentseva, Tatiana Batura
In this paper, we present a system for information extraction from scientific texts in the Russian language.
1 code implementation • RANLP 2021 • Natalia Loukachevitch, Ekaterina Artemova, Tatiana Batura, Pavel Braslavski, Ilia Denisov, Vladimir Ivanov, Suresh Manandhar, Alexander Pugachev, Elena Tutubalina
In this paper, we present NEREL, a Russian dataset for named entity recognition and relation extraction.
no code implementations • 19 Nov 2020 • Elena Bruches, Alexey Pauls, Tatiana Batura, Vladimir Isachenko
This paper is devoted to the study of methods for information extraction (entity recognition and relation classification) from scientific texts on information technology.
no code implementations • 29 Oct 2020 • Vitaly Ivanin, Ekaterina Artemova, Tatiana Batura, Vladimir Ivanov, Veronika Sarkisyan, Elena Tutubalina, Ivan Smurov
We show-case an application of information extraction methods, such as named entity recognition (NER) and relation extraction (RE) to a novel corpus, consisting of documents, issued by a state agency.
1 code implementation • 1 Jul 2020 • Ekaterina Artemova, Tatiana Batura, Anna Golenkovskaya, Vitaly Ivanin, Vladimir Ivanov, Veronika Sarkisyan, Ivan Smurov, Elena Tutubalina
In this paper we present a corpus of Russian strategic planning documents, RuREBus.