Search Results for author: Grigori Sidorov

Found 37 papers, 3 papers with code

MUCIC@TamilNLP-ACL2022: Abusive Comment Detection in Tamil Language using 1D Conv-LSTM

1 code implementation DravidianLangTech (ACL) 2022 Fazlourrahman Balouchzahi, Anusha Gowda, Hosahalli Shashirekha, Grigori Sidorov

To address the automatic detection of abusive languages in online platforms, this paper describes the models submitted by our team - MUCIC to the shared task on “Abusive Comment Detection in Tamil-ACL 2022”.

Abusive Language

The IPN-CIC team system submission for the WMT 2020 similar language task

no code implementations WMT (EMNLP) 2020 Luis A. Menéndez-Salazar, Grigori Sidorov, Marta R. Costa-jussà

This paper describes the participation of the NLP research team of the IPN Computer Research center in the WMT 2020 Similar Language Translation Task.

Domain Adaptation Translation

MUCIC at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using N-grams and Multilingual Sentence Encoders

no code implementations ICON 2021 Fazlourrahman Balouchzahi, Oxana Vitman, Hosahalli Lakshmaiah Shashirekha, Grigori Sidorov, Alexander Gelbukh

These approaches obtained the highest performance in the shared task for Meitei, Bangla, and Multilingual texts with instance-F1 scores of 0. 350, 0. 412, and 0. 380 respectively using Pre-aggregation of labels.

Blocking Language Identification +4

GuReT: Distinguishing Guilt and Regret related Text

no code implementations29 Jan 2024 Sabur Butt, Fazlourrahman Balouchzahi, Abdul Gafar Manuel Meque, Maaz Amjad, Hector G. Ceballos Cancino, Grigori Sidorov, Alexander Gelbukh

The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being.

Binary Classification Decision Making

Leveraging the power of transformers for guilt detection in text

no code implementations15 Jan 2024 Abdul Gafar Manuel Meque, Jason Angel, Grigori Sidorov, Alexander Gelbukh

In recent years, language models and deep learning techniques have revolutionized natural language processing tasks, including emotion detection.

SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish

no code implementations7 Nov 2023 Yevhen Kostiuk, Grigori Sidorov, Olga Kolesnikova

In this paper, we present a dataset of most frequent Spanish verb-noun collocations and sentences where they occur, each collocation is assigned to one of 37 lexical functions defined as classes for a hierarchical classification task.

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

no code implementations2 Jun 2023 Yevhen Kostiuk, Atnafu Lambebo Tonja, Grigori Sidorov, Olga Kolesnikova

In this paper, we investigate the issue of hate speech by presenting a novel task of translating hate speech into non-hate speech text while preserving its meaning.

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models

no code implementations27 May 2023 Atnafu Lambebo Tonja, Hellina Hailu Nigatu, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh, Jugal Kalita

This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas.

Machine Translation Transfer Learning +1

Guilt Detection in Text: A Step Towards Understanding Complex Emotions

no code implementations6 Mar 2023 Abdul Gafar Manuel Meque, Nisar Hussain, Grigori Sidorov, Alexander Gelbukh

We introduce a novel Natural Language Processing (NLP) task called Guilt detection, which focuses on detecting guilt in text.

Sarcasm Detection Framework Using Context, Emotion and Sentiment Features

no code implementations23 Nov 2022 Oxana Vitman, Yevhen Kostiuk, Grigori Sidorov, Alexander Gelbukh

We use a pre-trained transformer and CNN to capture context features, and we use transformers pre-trained on emotions detection and sentiment analysis tasks.

Sarcasm Detection Sentiment Analysis

The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation

2 code implementations27 Oct 2022 Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov, Alexander Gelbukh

Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another.

Machine Translation Sentence +1

PolyHope: Two-Level Hope Speech Detection from Tweets

no code implementations25 Oct 2022 Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh

This strict annotation process resulted in promising performance for simple machine learning classifiers with only bi-grams; however, binary and multiclass hope speech detection results reveal that contextual embedding models have higher performance in this dataset.

Hope Speech Detection Vocal Bursts Valence Prediction

Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences

no code implementations23 Oct 2022 Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov

Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally.

Data-to-Text Generation Sentence

Overview of Abusive and Threatening Language Detection in Urdu at FIRE 2021

1 code implementation14 Jul 2022 Maaz Amjad, Alisa Zhila, Grigori Sidorov, Andrey Labunets, Sabur Butta, Hamza Imam Amjad, Oxana Vitman, Alexander Gelbukh

In this paper, we present two shared tasks of abusive and threatening language detection for the Urdu language which has more than 170 million speakers worldwide.

Abusive Language Binary Classification

UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu

no code implementations11 Jul 2022 Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh

This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.

Binary Classification Fake News Detection

Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021

no code implementations11 Jul 2022 Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Alisa Zhila, Grigori Sidorov, Alexander Gelbukh

Admittedly, while training sets from the past and the current years overlap to a large extent, the testing set provided this year is completely different.

Binary Classification Fake News Detection

What goes on inside rumour and non-rumour tweets and their reactions: A Psycholinguistic Analyses

no code implementations9 Nov 2021 Sabur Butt, Shakshi Sharma, Rajesh Sharma, Grigori Sidorov, Alexander Gelbukh

In the descriptive line of works, where researchers have tried to analyse rumours using NLP approaches, there isnt much emphasis on psycho-linguistics analyses of social media text.

Descriptive Misinformation +1

Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language

no code implementations LREC 2020 Maaz Amjad, Grigori Sidorov, Alisa Zhila

As the fake news phenomenon is omnipresent across all languages, it is crucial to be able to efficiently solve this problem for languages other than English.

Data Augmentation Fake News Detection +2

The Role of Emotions in Native Language Identification

no code implementations WS 2018 Ilia Markov, Vivi Nastase, Carlo Strapparava, Grigori Sidorov

We explore the hypothesis that emotion is one of the dimensions of language that surfaces from the native language into a second language.

Deception Detection Native Language Identification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.