no code implementations • GWC 2019 • Kiril Simov, Petya Osenova, Laska Laskova, Ivajlo Radev, Zara Kancheva
The paper reports on an ongoing work that manually maps the Bulgarian WordNet BTB-WN with Bulgarian Wikipedia.
no code implementations • CLIB 2020 • Petya Osenova
Second, the semantic constraints that are considered here, are limited to a set of semantic roles and build on the lexicographic classes of verbs in WordNet.
no code implementations • RANLP 2021 • Iva Marinova, Yolina Petrova, Milena Slavcheva, Petya Osenova, Ivaylo Radev, Kiril Simov
The paper describes a system for automatic summarization in English language of online news data that come from different non-English languages.
no code implementations • GWC 2018 • Kiril Simov, Alexander Popov, Iliana Simova, Petya Osenova
In this paper we present an approach for training verb subatom embeddings.
no code implementations • LREC 2022 • Petya Osenova, Kiril Simov, Iva Marinova, Melania Berbatova
It will be used for: extracting knowledge and making it available through the Bulgaria-centric Knowledge Graph; further developing an annotation scheme that handles multiple domains in SSH; training automatic modules for the most important knowledge-based tasks, such as domain-specific and nested NER, NEL, event detection and profiling.
no code implementations • GWC 2016 • Kiril Simov, Alexander Popov, Petya Osenova
In this paper we present an analysis of different semantic relations extracted from WordNet, Extended WordNet and SemCor, with respect to their role in the task of knowledge-based word sense disambiguation.
no code implementations • CLIB 2022 • Petya Osenova
The paper discusses the raising and control syntactic structures (marked as ‘xcomp’) in a UD parsed corpus of Bulgarian Parliamentary Sessions.
no code implementations • ParlaCLARIN (LREC) 2022 • Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Meden Katja
In ParlaMint I, a CLARIN-ERIC supported project in pandemic times, a set of comparable and uniformly annotated multilingual corpora for 17 national parliaments were developed and released in 2021.
no code implementations • EACL (BSNLP) 2021 • Jakub Piskorski, Bogdan Babych, Zara Kancheva, Olga Kanishcheva, Maria Lebedeva, Michał Marcińczuk, Preslav Nakov, Petya Osenova, Lidia Pivovarova, Senja Pollak, Pavel Přibáň, Ivaylo Radev, Marko Robnik-Sikonja, Vasyl Starko, Josef Steinberger, Roman Yangarber
Seven teams covered all six languages, and five teams participated in the cross-lingual entity linking task.
2 code implementations • 4 Jun 2023 • Momchil Hardalov, Pepa Atanasova, Todor Mihaylov, Galia Angelova, Kiril Simov, Petya Osenova, Ves Stoyanov, Ivan Koychev, Preslav Nakov, Dragomir Radev
We run the first systematic evaluation of pre-trained language models for Bulgarian, comparing and contrasting results across the nine tasks in the benchmark.
no code implementations • 3 Jul 2022 • Kristian Miok, Encarnacion Hidalgo-Tenorio, Petya Osenova, Miguel-Angel Benitez-Castro, Marko Robnik-Sikonja
Parliamentary and legislative debate transcripts provide informative insight into elected politicians' opinions, positions, and policy preferences.
no code implementations • 26 Sep 2021 • Georgi Georgiev, Preslav Nakov, Kuzman Ganchev, Petya Osenova, Kiril Ivanov Simov
The paper presents a feature-rich approach to the automatic recognition and categorization of named entities (persons, organizations, locations, and miscellaneous) in news text for Bulgarian.
no code implementations • LREC 2020 • Iva Marinova, Laska Laskova, Petya Osenova, Kiril Simov, Alex Popov, er
The paper reports on the usage of deep learning methods for improving a Named Entity Recognition (NER) training corpus and for predicting and annotating new types in a test corpus.
1 code implementation • LREC 2020 • Sina Ahmadi, John Philip McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette Pedersen, Thierry Declerck, Tanja Wissik, Bell, Andrea i, Irene Pisani, Thomas Troelsg{\aa}rd, Sussi Olsen, Simon Krek, Veronika Lipp, Tam{\'a}s V{\'a}radi, L{\'a}szl{\'o} Simon, Andr{\'a}s Gyorffy, Carole Tiberius, Tanneke Schoonheim, Yifat Ben Moshe, Maya Rudich, Raya Abu Ahmad, Dorielle Lonke, Kira Kovalenko, Margit Langemets, Jelena Kallas, Oksana Dereza, Theodorus Fransen, David Cillessen, David Lindemann, Mikel Alonso, Ana Salgado, Jos{\'e} Luis Sancho, Rafael-J. Ure{\~n}a-Ruiz, Jordi Porta Zamorano, Kiril Simov, Petya Osenova, Zara Kancheva, Ivaylo Radev, Ranka Stankovi{\'c}, Andrej Perdih, Dejan Gabrovsek
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography.
no code implementations • EACL 2012 • Georgi Georgiev, Valentin Zhikov, Petya Osenova, Kiril Simov, Preslav Nakov
We present experiments with part-of-speech tagging for Bulgarian, a Slavic language with rich inflectional and derivational morphology.
no code implementations • RANLP 2019 • Alex Popov, er, Kiril Simov, Petya Osenova
This paper introduces several improvements over the current state of the art in knowledge-based word sense disambiguation.
no code implementations • RANLP 2019 • Lilia Simeonova, Kiril Simov, Petya Osenova, Preslav Nakov
We propose a morphologically informed model for named entity recognition, which is based on LSTM-CRF architecture and combines word embeddings, Bi-LSTM character embeddings, part-of-speech (POS) tags, and morphological information.
no code implementations • WS 2019 • Laska Laskova, Petya Osenova, Kiril Simov, Ivajlo Radev, Zara Kancheva
The paper presents the characteristics of the predominant types of MultiWord expressions (MWEs) in the BulTreeBank WordNet {--} BTB-WN.
no code implementations • WS 2017 • Kalliopi Zervanou, Petya Osenova, Austria Eveline Wandl-Vogt, Austrian Academy of Sciences, Romania Dan Cristea, "Alexandru Ioan Cuza" University of Iasi
no code implementations • RANLP 2017 • Kiril Simov, Svetla Boytcheva, Petya Osenova
Word vectors with varying dimensionalities and produced by different algorithms have been extensively used in NLP.
no code implementations • RANLP 2017 • Petya Osenova, Kiril Simov
The paper presents a deep factored machine translation (MT) system between English and Bulgarian languages in both directions.
no code implementations • WS 2016 • Rosa Gaudio, Gorka Labaka, Eneko Agirre, Petya Osenova, Kiril Simov, Martin Popel, Dieke Oele, Gertjan van Noord, Lu{\'\i}s Gomes, Jo{\~a}o Ant{\'o}nio Rodrigues, Steven Neale, Jo{\~a}o Silva, Andreia Querido, Nuno Rendeiro, Ant{\'o}nio Branco
no code implementations • LREC 2016 • Arantxa Otegi, Nora Aranberri, Antonio Branco, Jan Haji{\v{c}}, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, Jo{\~a}o Silva, Steven Neale
This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part-of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference.
no code implementations • LREC 2016 • Victoria Ros{\'e}n, Koenraad De Smedt, Gyri Sm{\o}rdal Losnegaard, Eduard Bej{\v{c}}ek, Agata Savary, Petya Osenova
The comparison is focused on the annotation of light verb constructions and verbal idioms.
no code implementations • LREC 2016 • John Philip McCrae, Christian Chiarcos, Francis Bond, Philipp Cimiano, Thierry Declerck, Gerard de Melo, Jorge Gracia, Sebastian Hellmann, Bettina Klimek, Steven Moran, Petya Osenova, Antonio Pareja-Lora, Jonathan Pool
The Open Linguistics Working Group (OWLG) brings together researchers from various fields of linguistics, natural language processing, and information technology to present and discuss principles, case studies, and best practices for representing, publishing and linking linguistic data collections.
no code implementations • LREC 2014 • Masood Ghayoomi, Kiril Simov, Petya Osenova
In this paper, we report the obtained results of two constituency parsers trained with BulTreeBank, an HPSG-based treebank for Bulgarian.
no code implementations • LREC 2014 • Kiril Simov, Iliana Simova, Ginka Ivanova, Maria Mateva, Petya Osenova
In this paper we present a system for experimenting with combinations of dependency parsers.
no code implementations • LREC 2012 • Petya Osenova, Kiril Simov
The paper introduces the Political Speech Corpus of Bulgarian.
no code implementations • LREC 2012 • Aleks Savkov, ar, Laska Laskova, Stanislava Kancheva, Petya Osenova, Kiril Simov
This paper presents a linguistic processing pipeline for Bulgarian including morphological analysis, lemmatization and syntactic analysis of Bulgarian texts.
no code implementations • LREC 2012 • Petya Osenova, Kiril Simov, Laska Laskova, Stanislava Kancheva
The paper presents a treebank-driven approach to the construction of a Bulgarian valence lexicon with ontological restrictions over the inner participants of the event.