Search Results for author: Grigori Sidorov

Found 37 papers, 3 papers with code

MUCIC@LT-EDI-ACL2022: Hope Speech Detection using Data Re-Sampling and 1D Conv-LSTM

no code implementations • LTEDI (ACL) 2022 • Anusha Gowda, Fazlourrahman Balouchzahi, Hosahalli Shashirekha, Grigori Sidorov

Spreading positive vibes or hope content on social media may help many people to get motivated in their life.

Hope Speech Detection

Paper
Add Code

MUCIC@TamilNLP-ACL2022: Abusive Comment Detection in Tamil Language using 1D Conv-LSTM

1 code implementation • DravidianLangTech (ACL) 2022 • Fazlourrahman Balouchzahi, Anusha Gowda, Hosahalli Shashirekha, Grigori Sidorov

To address the automatic detection of abusive languages in online platforms, this paper describes the models submitted by our team - MUCIC to the shared task on “Abusive Comment Detection in Tamil-ACL 2022”.

Abusive Language

Paper
Code

The IPN-CIC team system submission for the WMT 2020 similar language task

no code implementations • WMT (EMNLP) 2020 • Luis A. Menéndez-Salazar, Grigori Sidorov, Marta R. Costa-jussà

This paper describes the participation of the NLP research team of the IPN Computer Research center in the WMT 2020 Similar Language Translation Task.

Domain Adaptation Translation

Paper
Add Code

CIC NLP at SMM4H 2022: a BERT-based approach for classification of social media forum posts

no code implementations • SMM4H (COLING) 2022 • Atnafu Lambebo Tonja, Olumide Ebenezer Ojo, Mohammed Arif Khan, Abdul Gafar Manuel Meque, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh

This paper describes our submissions for the Social Media Mining for Health (SMM4H) 2022 shared tasks.

Classification

Paper
Add Code

CIC@LT-EDI-ACL2022: Are transformers the only hope? Hope speech detection for Spanish and English comments

no code implementations • LTEDI (ACL) 2022 • Fazlourrahman Balouchzahi, Sabur Butt, Grigori Sidorov, Alexander Gelbukh

Hope is an inherent part of human life and essential for improving the quality of life.

Hope Speech Detection

Paper
Add Code

MUCIC at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using N-grams and Multilingual Sentence Encoders

no code implementations • ICON 2021 • Fazlourrahman Balouchzahi, Oxana Vitman, Hosahalli Lakshmaiah Shashirekha, Grigori Sidorov, Alexander Gelbukh

These approaches obtained the highest performance in the shared task for Meitei, Bangla, and Multilingual texts with instance-F1 scores of 0. 350, 0. 412, and 0. 380 respectively using Pre-aggregation of labels.

Blocking Language Identification +4

Paper
Add Code

GuReT: Distinguishing Guilt and Regret related Text

no code implementations • 29 Jan 2024 • Sabur Butt, Fazlourrahman Balouchzahi, Abdul Gafar Manuel Meque, Maaz Amjad, Hector G. Ceballos Cancino, Grigori Sidorov, Alexander Gelbukh

The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being.

Binary Classification Decision Making

Paper
Add Code

Leveraging the power of transformers for guilt detection in text

no code implementations • 15 Jan 2024 • Abdul Gafar Manuel Meque, Jason Angel, Grigori Sidorov, Alexander Gelbukh

In recent years, language models and deep learning techniques have revolutionized natural language processing tasks, including emotion detection.

Paper
Add Code

SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish

no code implementations • 7 Nov 2023 • Yevhen Kostiuk, Grigori Sidorov, Olga Kolesnikova

In this paper, we present a dataset of most frequent Spanish verb-noun collocations and sentences where they occur, each collocation is assigned to one of 37 lexical functions defined as classes for a hierarchical classification task.

Paper
Add Code

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

no code implementations • 2 Jun 2023 • Yevhen Kostiuk, Atnafu Lambebo Tonja, Grigori Sidorov, Olga Kolesnikova

In this paper, we investigate the issue of hate speech by presenting a novel task of translating hate speech into non-hate speech text while preserving its meaning.

Paper
Add Code

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models

no code implementations • 27 May 2023 • Atnafu Lambebo Tonja, Hellina Hailu Nigatu, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh, Jugal Kalita

This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas.

Machine Translation Transfer Learning +1

Paper
Add Code

Parallel Corpus for Indigenous Language Translation: Spanish-Mazatec and Spanish-Mixtec

no code implementations • 27 May 2023 • Atnafu Lambebo Tonja, Christian Maldonado-Sifuentes, David Alejandro Mendoza Castillo, Olga Kolesnikova, Noé Castro-Sánchez, Grigori Sidorov, Alexander Gelbukh

In this paper, we present a parallel Spanish-Mazatec and Spanish-Mixtec corpus for machine translation (MT) tasks, where Mazatec and Mixtec are two indigenous Mexican languages.

Few-Shot Learning Machine Translation +2

Paper
Add Code

Transformer-based approaches to Sentiment Detection

no code implementations • 13 Mar 2023 • Olumide Ebenezer Ojo, Hoang Thang Ta, Alexander Gelbukh, Hiram Calvo, Olaronke Oluwayemisi Adebanji, Grigori Sidorov

The performance of the four models that were used to detect disaster in the text was compared.

text-classification Text Classification +1

Paper
Add Code

Guilt Detection in Text: A Step Towards Understanding Complex Emotions

no code implementations • 6 Mar 2023 • Abdul Gafar Manuel Meque, Nisar Hussain, Grigori Sidorov, Alexander Gelbukh

We introduce a novel Natural Language Processing (NLP) task called Guilt detection, which focuses on detecting guilt in text.

Paper
Add Code

ReDDIT: Regret Detection and Domain Identification from Text

no code implementations • 14 Dec 2022 • Fazlourrahman Balouchzahi, Sabur Butt, Grigori Sidorov, Alexander Gelbukh

In this paper, we present a study of regret and its expression on social media platforms.

Word Embeddings

Paper
Add Code

Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts

no code implementations • 26 Nov 2022 • Atnafu Lambebo Tonja, Mesay Gemeda Yigezu, Olga Kolesnikova, Moein Shahiki Tash, Grigori Sidorov, Alexander Gelbuk

Using code-mixed data in natural language processing (NLP) research currently gets a lot of attention.

Language Identification

Paper
Add Code

Sarcasm Detection Framework Using Context, Emotion and Sentiment Features

no code implementations • 23 Nov 2022 • Oxana Vitman, Yevhen Kostiuk, Grigori Sidorov, Alexander Gelbukh

We use a pre-trained transformer and CNN to capture context features, and we use transformers pre-trained on emotions detection and sentiment analysis tasks.

Sarcasm Detection Sentiment Analysis

Paper
Add Code

The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation

2 code implementations • 27 Oct 2022 • Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov, Alexander Gelbukh

Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another.

Machine Translation Sentence +1

Paper
Code

PolyHope: Two-Level Hope Speech Detection from Tweets

no code implementations • 25 Oct 2022 • Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh

This strict annotation process resulted in promising performance for simple machine learning classifiers with only bi-grams; however, binary and multiclass hope speech detection results reveal that contextual embedding models have higher performance in this dataset.

Hope Speech Detection Vocal Bursts Valence Prediction

Paper
Add Code

Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences

no code implementations • 23 Oct 2022 • Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov

Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally.

Data-to-Text Generation Sentence

Paper
Add Code

UrduFake@FIRE2020: Shared Track on Fake News Identification in Urdu

no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso

This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.

BIG-bench Machine Learning Binary Classification +1

Paper
Add Code

Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2020

no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso

This overview paper describes the first shared task on fake news detection in Urdu language.

BIG-bench Machine Learning Binary Classification +1

Paper
Add Code

Overview of Abusive and Threatening Language Detection in Urdu at FIRE 2021

1 code implementation • 14 Jul 2022 • Maaz Amjad, Alisa Zhila, Grigori Sidorov, Andrey Labunets, Sabur Butta, Hamza Imam Amjad, Oxana Vitman, Alexander Gelbukh

In this paper, we present two shared tasks of abusive and threatening language detection for the Urdu language which has more than 170 million speakers worldwide.

Abusive Language Binary Classification

Paper
Code

UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu

no code implementations • 11 Jul 2022 • Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh

This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.

Binary Classification Fake News Detection

Paper
Add Code

Overview of the Shared Task on Fake News Detection in Urdu at FIRE 2021

no code implementations • 11 Jul 2022 • Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Alisa Zhila, Grigori Sidorov, Alexander Gelbukh

Admittedly, while training sets from the past and the current years overlap to a large extent, the testing set provided this year is completely different.

Binary Classification Fake News Detection

Paper
Add Code

Mental Illness Classification on Social Media Texts using Deep Learning and Transfer Learning

no code implementations • 3 Jul 2022 • Iqra Ameer, Muhammad Arif, Grigori Sidorov, Helena Gòmez-Adorno, Alexander Gelbukh

According to the World health organization (WHO), approximately 450 million people are affected.

Decision Making Transfer Learning

Paper
Add Code

Job Offers Classifier using Neural Networks and Oversampling Methods

no code implementations • 3 Jul 2022 • Germán Ortiz, Gemma Bel Enguix, Helena Gómez-Adorno, Iqra Ameer, Grigori Sidorov

Both policy and research benefit from a better understanding of individuals' jobs.

Management Marketing

Paper
Add Code

What goes on inside rumour and non-rumour tweets and their reactions: A Psycholinguistic Analyses

no code implementations • 9 Nov 2021 • Sabur Butt, Shakshi Sharma, Rajesh Sharma, Grigori Sidorov, Alexander Gelbukh

In the descriptive line of works, where researchers have tried to analyse rumours using NLP approaches, there isnt much emphasis on psycho-linguistics analyses of social media text.

Descriptive Misinformation +1

Paper
Add Code

Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language

no code implementations • LREC 2020 • Maaz Amjad, Grigori Sidorov, Alisa Zhila

As the fake news phenomenon is omnipresent across all languages, it is crucial to be able to efficiently solve this problem for languages other than English.

Data Augmentation Fake News Detection +2

Paper
Add Code

CIC at SemEval-2019 Task 5: Simple Yet Very Efficient Approach to Hate Speech Detection, Aggressive Behavior Detection, and Target Classification in Twitter

no code implementations • SEMEVAL 2019 • Iqra Ameer, Muhammad Hammad Fahim Siddiqui, Grigori Sidorov, Alex Gelbukh, er

The goal of this paper is to detect (A) Hate speech against immigrants and women, (B) Aggressive behavior and target classification, both for English and Spanish.

Hate Speech Detection