no code implementations • GWC 2016 • Kiril Simov, Alexander Popov, Petya Osenova
In this paper we present an analysis of different semantic relations extracted from WordNet, Extended WordNet and SemCor, with respect to their role in the task of knowledge-based word sense disambiguation.
no code implementations • LREC 2022 • Petya Osenova, Kiril Simov, Iva Marinova, Melania Berbatova
It will be used for: extracting knowledge and making it available through the Bulgaria-centric Knowledge Graph; further developing an annotation scheme that handles multiple domains in SSH; training automatic modules for the most important knowledge-based tasks, such as domain-specific and nested NER, NEL, event detection and profiling.
no code implementations • GWC 2018 • Kiril Simov, Alexander Popov, Iliana Simova, Petya Osenova
In this paper we present an approach for training verb subatom embeddings.
no code implementations • RANLP 2021 • Iva Marinova, Yolina Petrova, Milena Slavcheva, Petya Osenova, Ivaylo Radev, Kiril Simov
The paper describes a system for automatic summarization in English language of online news data that come from different non-English languages.
no code implementations • GWC 2019 • Kiril Simov, Petya Osenova, Laska Laskova, Ivajlo Radev, Zara Kancheva
The paper reports on an ongoing work that manually maps the Bulgarian WordNet BTB-WN with Bulgarian Wikipedia.
2 code implementations • 4 Jun 2023 • Momchil Hardalov, Pepa Atanasova, Todor Mihaylov, Galia Angelova, Kiril Simov, Petya Osenova, Ves Stoyanov, Ivan Koychev, Preslav Nakov, Dragomir Radev
We run the first systematic evaluation of pre-trained language models for Bulgarian, comparing and contrasting results across the nine tasks in the benchmark.
no code implementations • LREC 2020 • Iva Marinova, Laska Laskova, Petya Osenova, Kiril Simov, Alex Popov, er
The paper reports on the usage of deep learning methods for improving a Named Entity Recognition (NER) training corpus and for predicting and annotating new types in a test corpus.
1 code implementation • LREC 2020 • Sina Ahmadi, John Philip McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette Pedersen, Thierry Declerck, Tanja Wissik, Bell, Andrea i, Irene Pisani, Thomas Troelsg{\aa}rd, Sussi Olsen, Simon Krek, Veronika Lipp, Tam{\'a}s V{\'a}radi, L{\'a}szl{\'o} Simon, Andr{\'a}s Gyorffy, Carole Tiberius, Tanneke Schoonheim, Yifat Ben Moshe, Maya Rudich, Raya Abu Ahmad, Dorielle Lonke, Kira Kovalenko, Margit Langemets, Jelena Kallas, Oksana Dereza, Theodorus Fransen, David Cillessen, David Lindemann, Mikel Alonso, Ana Salgado, Jos{\'e} Luis Sancho, Rafael-J. Ure{\~n}a-Ruiz, Jordi Porta Zamorano, Kiril Simov, Petya Osenova, Zara Kancheva, Ivaylo Radev, Ranka Stankovi{\'c}, Andrej Perdih, Dejan Gabrovsek
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography.
no code implementations • EACL 2012 • Georgi Georgiev, Valentin Zhikov, Petya Osenova, Kiril Simov, Preslav Nakov
We present experiments with part-of-speech tagging for Bulgarian, a Slavic language with rich inflectional and derivational morphology.
no code implementations • RANLP 2019 • Alex Popov, er, Kiril Simov, Petya Osenova
This paper introduces several improvements over the current state of the art in knowledge-based word sense disambiguation.
no code implementations • RANLP 2019 • Lilia Simeonova, Kiril Simov, Petya Osenova, Preslav Nakov
We propose a morphologically informed model for named entity recognition, which is based on LSTM-CRF architecture and combines word embeddings, Bi-LSTM character embeddings, part-of-speech (POS) tags, and morphological information.
no code implementations • WS 2019 • Laska Laskova, Petya Osenova, Kiril Simov, Ivajlo Radev, Zara Kancheva
The paper presents the characteristics of the predominant types of MultiWord expressions (MWEs) in the BulTreeBank WordNet {--} BTB-WN.
no code implementations • RANLP 2017 • Kiril Simov, Svetla Boytcheva, Petya Osenova
Word vectors with varying dimensionalities and produced by different algorithms have been extensively used in NLP.
no code implementations • RANLP 2017 • Petya Osenova, Kiril Simov
The paper presents a deep factored machine translation (MT) system between English and Bulgarian languages in both directions.
no code implementations • RANLP 2017 • Ivajlo Radev, Kiril Simov, Galia Angelova, Svetla Boytcheva
In this paper we describe annotation process of clinical texts with morphosyntactic and semantic information.
no code implementations • WS 2016 • Rosa Gaudio, Gorka Labaka, Eneko Agirre, Petya Osenova, Kiril Simov, Martin Popel, Dieke Oele, Gertjan van Noord, Lu{\'\i}s Gomes, Jo{\~a}o Ant{\'o}nio Rodrigues, Steven Neale, Jo{\~a}o Silva, Andreia Querido, Nuno Rendeiro, Ant{\'o}nio Branco
no code implementations • LREC 2016 • Arantxa Otegi, Nora Aranberri, Antonio Branco, Jan Haji{\v{c}}, Martin Popel, Kiril Simov, Eneko Agirre, Petya Osenova, Rita Pereira, Jo{\~a}o Silva, Steven Neale
This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part-of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference.
no code implementations • LREC 2014 • Kiril Simov, Iliana Simova, Ginka Ivanova, Maria Mateva, Petya Osenova
In this paper we present a system for experimenting with combinations of dependency parsers.
no code implementations • LREC 2014 • Masood Ghayoomi, Kiril Simov, Petya Osenova
In this paper, we report the obtained results of two constituency parsers trained with BulTreeBank, an HPSG-based treebank for Bulgarian.
no code implementations • LREC 2012 • Aleks Savkov, ar, Laska Laskova, Stanislava Kancheva, Petya Osenova, Kiril Simov
This paper presents a linguistic processing pipeline for Bulgarian including morphological analysis, lemmatization and syntactic analysis of Bulgarian texts.
no code implementations • LREC 2012 • Petya Osenova, Kiril Simov, Laska Laskova, Stanislava Kancheva
The paper presents a treebank-driven approach to the construction of a Bulgarian valence lexicon with ontological restrictions over the inner participants of the event.
no code implementations • LREC 2012 • Petya Osenova, Kiril Simov
The paper introduces the Political Speech Corpus of Bulgarian.