Search Results for author: Artem Shelmanov

Found 22 papers, 11 papers with code

Towards the Data-driven System for Rhetorical Parsing of Russian Texts

no code implementations • WS 2019 • Artem Shelmanov, Dina Pisarevskaya, Elena Chistova, Svetlana Toldova, Maria Kobozeva, Ivan Smirnov

Results of the first experimental evaluation of machine learning models trained on Ru-RSTreebank {--} first Russian corpus annotated within RST framework {--} are presented.

BIG-bench Machine Learning Classification +3

Paper
Add Code

A Dataset for Noun Compositionality Detection for a Slavic Language

1 code implementation • WS 2019 • Dmitry Puzyrev, Artem Shelmanov, Alex Panchenko, er, Ekaterina Artemova

This paper presents the first gold-standard resource for Russian annotated with compositionality information of noun compounds.

Paper
Code

Semantic Role Labeling with Pretrained Language Models for Known and Unknown Predicates

no code implementations • RANLP 2019 • Daniil Larionov, Artem Shelmanov, Elena Chistova, Ivan Smirnov

We build the first full pipeline for semantic role labelling of Russian texts.

Classification General Classification +1

Paper
Add Code

Word Sense Disambiguation for 158 Languages using Word Embeddings Only

no code implementations • LREC 2020 • Varvara Logacheva, Denis Teslenko, Artem Shelmanov, Steffen Remus, Dmitry Ustalov, Andrey Kutuzov, Ekaterina Artemova, Chris Biemann, Simone Paolo Ponzetto, Alexander Panchenko

We use this method to induce a collection of sense inventories for 158 languages on the basis of the original pre-trained fastText word embeddings by Grave et al. (2018), enabling WSD in these languages.

Word Embeddings Word Sense Disambiguation

Paper
Add Code

Neural Entity Linking: A Survey of Models Based on Deep Learning

no code implementations • 31 May 2020 • Ozge Sevgili, Artem Shelmanov, Mikhail Arkhipov, Alexander Panchenko, Chris Biemann

This survey presents a comprehensive description of recent neural entity linking (EL) systems developed since 2015 as a result of the "deep learning revolution" in natural language processing.

Entity Embeddings Entity Linking

Paper
Add Code

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

no code implementations • EACL 2021 • Artem Shelmanov, Dmitri Puzyrev, Lyubov Kupriyanova, Denis Belyakov, Daniil Larionov, Nikita Khromov, Olga Kozlova, Ekaterina Artemova, Dmitry V. Dylov, Alexander Panchenko

Annotating training data for sequence tagging of texts is usually very time-consuming.

Active Learning Transfer Learning

Paper
Add Code

How Certain is Your Transformer?

1 code implementation • EACL 2021 • Artem Shelmanov, Evgenii Tsymbalov, Dmitri Puzyrev, Kirill Fedyanin, Alexander Panchenko, Maxim Panov

In this work, we consider the problem of uncertainty estimation for Transformer-based models.

Natural Language Understanding Point Processes

Paper
Code

Nonparametric Uncertainty Quantification for Single Deterministic Neural Network

1 code implementation • 7 Feb 2022 • Nikita Kotelevskii, Aleksandr Artemenkov, Kirill Fedyanin, Fedor Noskov, Alexander Fishkov, Artem Shelmanov, Artem Vazhentsev, Aleksandr Petiushko, Maxim Panov

This paper proposes a fast and scalable method for uncertainty quantification of machine learning models' predictions.

text-classification Text Classification +1

Paper
Code

Towards Computationally Feasible Deep Active Learning

1 code implementation • Findings (NAACL) 2022 • Akim Tsvigun, Artem Shelmanov, Gleb Kuzmin, Leonid Sanochkin, Daniil Larionov, Gleb Gusev, Manvel Avetisian, Leonid Zhukov

Active learning (AL) is a prominent technique for reducing the annotation effort required for training machine learning models.

Active Learning text-classification +1

Paper
Code

NeuralSympCheck: A Symptom Checking and Disease Diagnostic Neural Model with Logic Regularization

1 code implementation • 2 Jun 2022 • Aleksandr Nesterov, Bulat Ibragimov, Dmitriy Umerenkov, Artem Shelmanov, Galina Zubkova, Vladimir Kokh

The symptom checking systems inquire users for their symptoms and perform a rapid and affordable medical assessment of their condition.

Reinforcement Learning (RL)

Paper
Code

Medical Image Captioning via Generative Pretrained Transformers

no code implementations • 28 Sep 2022 • Alexander Selivanov, Oleg Y. Rogov, Daniil Chesakov, Artem Shelmanov, Irina Fedulova, Dmitry V. Dylov

The proposed model is tested on two medical datasets, the Open-I, MIMIC-CXR, and the general-purpose MS-COCO.

Caption Generation Descriptive +1

Paper
Add Code

Active Learning for Abstractive Text Summarization

1 code implementation • 9 Jan 2023 • Akim Tsvigun, Ivan Lysenko, Danila Sedashov, Ivan Lazichny, Eldar Damirov, Vladimir Karlov, Artemy Belousov, Leonid Sanochkin, Maxim Panov, Alexander Panchenko, Mikhail Burtsev, Artem Shelmanov

Active Learning (AL) is a technique developed to reduce the amount of annotation required to achieve a certain level of machine learning model performance.

Abstractive Text Summarization Active Learning +3

Paper
Code

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection

2 code implementations • 24 May 2023 • Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Chenxi Whitehouse, Osama Mohammed Afzal, Tarek Mahmoud, Toru Sasaki, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

These results show that the problem is far from solved and that there is a lot of room for improvement.

Text Detection

Paper
Code

LM-Polygraph: Uncertainty Estimation for Language Models

no code implementations • 13 Nov 2023 • Ekaterina Fadeeva, Roman Vashurin, Akim Tsvigun, Artem Vazhentsev, Sergey Petrakov, Kirill Fedyanin, Daniil Vasilev, Elizaveta Goncharova, Alexander Panchenko, Maxim Panov, Timothy Baldwin, Artem Shelmanov

Recent advancements in the capabilities of large language models (LLMs) have paved the way for a myriad of groundbreaking applications in various fields.

Text Generation

Paper
Add Code

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

no code implementations • 17 Feb 2024 • Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohanned Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

The advent of Large Language Models (LLMs) has brought an unprecedented surge in machine-generated text (MGT) across diverse channels.

Task 2 Text Detection

Paper
Add Code

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

no code implementations • 7 Mar 2024 • Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov

Uncertainty scores leverage information encapsulated in the output of a neural network or its layers to detect unreliable predictions, and we show that they can be used to fact-check the atomic claims in the LLM output.

Fact Checking Hallucination +1

Paper
Add Code

NB-MLM: Efficient Domain Adaptation of Masked Language Models for Sentiment Analysis

1 code implementation • EMNLP 2021 • Nikolay Arefyev, Dmitrii Kharchev, Artem Shelmanov

While Masked Language Models (MLM) are pre-trained on massive datasets, the additional training with the MLM objective on domain or task-specific data before fine-tuning for the final task is known to improve the final performance.

Domain Adaptation Sentiment Analysis

Paper
Code

Medical Crossing: a Cross-lingual Evaluation of Clinical Entity Linking

1 code implementation • LREC 2022 • Anton Alekseev, Zulfat Miftahutdinov, Elena Tutubalina, Artem Shelmanov, Vladimir Ivanov, Vladimir Kokh, Alexander Nesterov, Manvel Avetisian, Andrei Chertok, Sergey Nikolenko

Medical data annotation requires highly qualified expertise.

Entity Linking Transfer Learning +1

Paper
Code

Entity Linking over Nested Named Entities for Russian

1 code implementation • LREC 2022 • Natalia Loukachevitch, Pavel Braslavski, Vladimir Ivanov, Tatiana Batura, Suresh Manandhar, Artem Shelmanov, Elena Tutubalina

In this paper, we describe entity linking annotation over nested named entities in the recently released Russian NEREL dataset for information extraction.

Entity Linking

Paper
Code

RuCCoN: Clinical Concept Normalization in Russian

no code implementations • Findings (ACL) 2022 • Alexandr Nesterov, Galina Zubkova, Zulfat Miftahutdinov, Vladimir Kokh, Elena Tutubalina, Artem Shelmanov, Anton Alekseev, Manvel Avetisian, Andrey Chertok, Sergey Nikolenko

We present RuCCoN, a new dataset for clinical concept normalization in Russian manually annotated by medical professionals.

Paper
Add Code

Uncertainty Estimation of Transformer Predictions for Misclassification Detection

1 code implementation • ACL 2022 • Artem Vazhentsev, Gleb Kuzmin, Artem Shelmanov, Akim Tsvigun, Evgenii Tsymbalov, Kirill Fedyanin, Maxim Panov, Alexander Panchenko, Gleb Gusev, Mikhail Burtsev, Manvel Avetisian, Leonid Zhukov

Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification detection, adversarial attack detection, out-of-distribution detection, etc.

Active Learning Adversarial Attack Detection +7

Paper
Code

Generating Lexical Representations of Frames using Lexical Substitution

no code implementations • PaM 2020 • Saba Anwar, Artem Shelmanov, Alexander Panchenko, Chris Biemann

We investigate a simple yet effective method, lexical substitution with word representation models, to automatically expand a small set of frame-annotated sentences with new words for their respective roles and LUs.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.