Search Results for author: Grigori Sidorov

Found 54 papers, 5 papers with code

MUCIC at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using N-grams and Multilingual Sentence Encoders

no code implementations ICON 2021 Fazlourrahman Balouchzahi, Oxana Vitman, Hosahalli Lakshmaiah Shashirekha, Grigori Sidorov, Alexander Gelbukh

These approaches obtained the highest performance in the shared task for Meitei, Bangla, and Multilingual texts with instance-F1 scores of 0. 350, 0. 412, and 0. 380 respectively using Pre-aggregation of labels.

Blocking Language Identification +4

MUCIC@TamilNLP-ACL2022: Abusive Comment Detection in Tamil Language using 1D Conv-LSTM

1 code implementation DravidianLangTech (ACL) 2022 Fazlourrahman Balouchzahi, Anusha Gowda, Hosahalli Shashirekha, Grigori Sidorov

To address the automatic detection of abusive languages in online platforms, this paper describes the models submitted by our team - MUCIC to the shared task on “Abusive Comment Detection in Tamil-ACL 2022”.

Abusive Language

The IPN-CIC team system submission for the WMT 2020 similar language task

no code implementations WMT (EMNLP) 2020 Luis A. Menéndez-Salazar, Grigori Sidorov, Marta R. Costa-jussà

This paper describes the participation of the NLP research team of the IPN Computer Research center in the WMT 2020 Similar Language Translation Task.

Domain Adaptation Translation

GS_DravidianLangTech@2025: Women Targeted Abusive Texts Detection on Social Media

no code implementations1 Apr 2025 Girma Yohannis Bade, Zahra Ahani, Olga Kolesnikova, José Luis Oropeza, Grigori Sidorov

The increasing misuse of social media has become a concern; however, technological solutions are being developed to moderate its content effectively.

Abusive Language regression

Opioid Named Entity Recognition (ONER-2025) from Reddit

no code implementations28 Mar 2025 Grigori Sidorov, Muhammad Ahmad, Iqra Ameer, Muhammad Usman, Ildar Batyrshin

Social media platforms like Reddit provide vast amounts of unstructured data that offer insights into public perceptions, discussions, and experiences related to opioid use.

named-entity-recognition Named Entity Recognition +1

Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages

no code implementations24 Mar 2025 Tadesse Destaw Belay, Dawit Ketema Gete, Abinew Ali Ayele, Olga Kolesnikova, Grigori Sidorov, Seid Muhie Yimam

As users express different emotions simultaneously in a single instance, annotating emotions in a multilabel setting such as the EthioEmo (Belay et al., 2025) dataset effectively captures this dynamic.

Benchmarking Decision Making +3

Automated diagnosis of lung diseases using vision transformer: a comparative study on chest x-ray classification

no code implementations22 Mar 2025 Muhammad Ahmad, Sardar Usman, Ildar Batyrshin, Muhammad Muzammil, K. Sajid, M. Hasnain, Muhammad Jalal, Grigori Sidorov

Early prediction of these conditions is critical, as it allows for the identification of risk factors and implementation of preventive measures to reduce the likelihood of disease onset Methods: In this study, we utilized a dataset comprising 3, 475 chest X-ray images sourced from from Mendeley Data provided by Talukder, M. A.

Binary Classification Diagnostic +3

PRKAN: Parameter-Reduced Kolmogorov-Arnold Networks

1 code implementation13 Jan 2025 Hoang-Thang Ta, Duy-Quy Thai, Anh Tran, Grigori Sidorov, Alexander Gelbukh

Kolmogorov-Arnold Networks (KANs) represent an innovation in neural network architectures, offering a compelling alternative to Multi-Layer Perceptrons (MLPs) in models such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Transformers.

Dimensionality Reduction Image Classification +1

Advanced Machine Learning Techniques for Social Support Detection on Social Media

no code implementations6 Jan 2025 Olga Kolesnikova, Moein Shahiki Tash, Zahra Ahani, Ameeta Agrawal, Raul Monroy, Grigori Sidorov

Additionally, we achieved a 0. 4\% increase in the macro F1 score for the second task and a 0. 7\% increase for the third task, compared to previous work utilizing traditional machine learning with psycholinguistic and unigram-based TF-IDF values.

Zero-Shot Learning

Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding

no code implementations17 Dec 2024 Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele, Grigori Sidorov, Dietrich Klakow, Philipp Slusallek, Olga Kolesnikova, Seid Muhie Yimam

The results show that accurate multi-label emotion classification is still insufficient even for high-resource languages such as English, and there is a large gap between the performance of high-resource and low-resource languages.

Decoder Emotion Classification

Social Support Detection from Social Media Texts

no code implementations4 Nov 2024 Zahra Ahani, Moein Shahiki Tash, Fazlourrahman Balouchzahi, Luis Ramos, Grigori Sidorov, Alexander Gelbukh

Social support, conveyed through a multitude of interactions and platforms such as social media, plays a pivotal role in fostering a sense of belonging, aiding resilience in the face of challenges, and enhancing overall well-being.

Binary Classification Word Embeddings

Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models

no code implementations4 Sep 2024 Moein Shahiki Tash, Zahra Ahani, Mohim Tash, Olga Kolesnikova, Grigori Sidorov

This study performs analysis of Predictive statements, Hope speech, and Regret Detection behaviors within cryptocurrency-related discussions, leveraging advanced natural language processing techniques.

Decision Making Few-Shot Learning +3

FC-KAN: Function Combinations in Kolmogorov-Arnold Networks

2 code implementations3 Sep 2024 Hoang-Thang Ta, Duy-Quy Thai, Abu Bakar Siddiqur Rahman, Grigori Sidorov, Alexander Gelbukh

In this paper, we introduce FC-KAN, a Kolmogorov-Arnold Network (KAN) that leverages combinations of popular mathematical functions such as B-splines, wavelets, and radial basis functions on low-dimensional data through element-wise operations.

Image Classification Kolmogorov-Arnold Networks

A multitask learning framework for leveraging subjectivity of annotators to identify misogyny

no code implementations22 Jun 2024 Jason Angel, Segun Taofeek Aroyehun, Grigori Sidorov, Alexander Gelbukh

Identifying misogyny using artificial intelligence is a form of combating online toxicity against women.

Anime Popularity Prediction Before Huge Investments: a Multimodal Approach Using Deep Learning

no code implementations21 Jun 2024 Jesús Armenta-Segura, Grigori Sidorov

To measure the accuracy of the model, mean squared error (MSE) was used, obtaining a best result of 0. 011 when considering all inputs and the full version of the deep neural network, compared to the benchmark MSE 0. 412 obtained with traditional TF-IDF and PILtotensor vectorizations.

Analyzing Emotional Trends from X platform using SenticNet: A Comparative Analysis with Cryptocurrency Price

no code implementations6 May 2024 Moein Shahiki Tash, Zahra Ahani, Olga Kolesnikova, Grigori Sidorov

This study delves into the relationship between emotional trends from X platform data and the market dynamics of well-known cryptocurrencies Cardano, Binance, Fantom, Matic, and Ripple over the period from October 2022 to March 2023.

GuReT: Distinguishing Guilt and Regret related Text

no code implementations29 Jan 2024 Sabur Butt, Fazlourrahman Balouchzahi, Abdul Gafar Manuel Meque, Maaz Amjad, Hector G. Ceballos Cancino, Grigori Sidorov, Alexander Gelbukh

The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being.

Binary Classification Decision Making

Leveraging the power of transformers for guilt detection in text

no code implementations15 Jan 2024 Abdul Gafar Manuel Meque, Jason Angel, Grigori Sidorov, Alexander Gelbukh

In recent years, language models and deep learning techniques have revolutionized natural language processing tasks, including emotion detection.

SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish

no code implementations7 Nov 2023 Yevhen Kostiuk, Grigori Sidorov, Olga Kolesnikova

In this paper, we present a dataset of most frequent Spanish verb-noun collocations and sentences where they occur, each collocation is assigned to one of 37 lexical functions defined as classes for a hierarchical classification task.

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

no code implementations2 Jun 2023 Yevhen Kostiuk, Atnafu Lambebo Tonja, Grigori Sidorov, Olga Kolesnikova

In this paper, we investigate the issue of hate speech by presenting a novel task of translating hate speech into non-hate speech text while preserving its meaning.

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models

no code implementations27 May 2023 Atnafu Lambebo Tonja, Hellina Hailu Nigatu, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh, Jugal Kalita

This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas.

Machine Translation Transfer Learning +1

Guilt Detection in Text: A Step Towards Understanding Complex Emotions

no code implementations6 Mar 2023 Abdul Gafar Manuel Meque, Nisar Hussain, Grigori Sidorov, Alexander Gelbukh

We introduce a novel Natural Language Processing (NLP) task called Guilt detection, which focuses on detecting guilt in text.

Sarcasm Detection Framework Using Context, Emotion and Sentiment Features

no code implementations23 Nov 2022 Oxana Vitman, Yevhen Kostiuk, Grigori Sidorov, Alexander Gelbukh

We use a pre-trained transformer and CNN to capture context features, and we use transformers pre-trained on emotions detection and sentiment analysis tasks.

Sarcasm Detection Sentiment Analysis

The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation

2 code implementations27 Oct 2022 Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov, Alexander Gelbukh

Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another.

Machine Translation Sentence +1

PolyHope: Two-Level Hope Speech Detection from Tweets

no code implementations25 Oct 2022 Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh

This strict annotation process resulted in promising performance for simple machine learning classifiers with only bi-grams; however, binary and multiclass hope speech detection results reveal that contextual embedding models have higher performance in this dataset.

Hope Speech Detection Vocal Bursts Valence Prediction

Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences

no code implementations23 Oct 2022 Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov

Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally.

Data-to-Text Generation Sentence

Overview of Abusive and Threatening Language Detection in Urdu at FIRE 2021

1 code implementation14 Jul 2022 Maaz Amjad, Alisa Zhila, Grigori Sidorov, Andrey Labunets, Sabur Butta, Hamza Imam Amjad, Oxana Vitman, Alexander Gelbukh

In this paper, we present two shared tasks of abusive and threatening language detection for the Urdu language which has more than 170 million speakers worldwide.

Abusive Language Binary Classification

UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu

no code implementations11 Jul 2022 Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh

This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.

Binary Classification Fake News Detection

Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021

no code implementations11 Jul 2022 Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Alisa Zhila, Grigori Sidorov, Alexander Gelbukh

Admittedly, while training sets from the past and the current years overlap to a large extent, the testing set provided this year is completely different.

Binary Classification Fake News Detection

What goes on inside rumour and non-rumour tweets and their reactions: A Psycholinguistic Analyses

no code implementations9 Nov 2021 Sabur Butt, Shakshi Sharma, Rajesh Sharma, Grigori Sidorov, Alexander Gelbukh

In the descriptive line of works, where researchers have tried to analyse rumours using NLP approaches, there isnt much emphasis on psycho-linguistics analyses of social media text.

Descriptive Misinformation +1

Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language

no code implementations LREC 2020 Maaz Amjad, Grigori Sidorov, Alisa Zhila

As the fake news phenomenon is omnipresent across all languages, it is crucial to be able to efficiently solve this problem for languages other than English.

Data Augmentation Fake News Detection +2

The Role of Emotions in Native Language Identification

no code implementations WS 2018 Ilia Markov, Vivi Nastase, Carlo Strapparava, Grigori Sidorov

We explore the hypothesis that emotion is one of the dimensions of language that surfaces from the native language into a second language.

Deception Detection Native Language Identification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.