no code implementations • WS 2017 • Helena Gomez, Ilia Markov, Jorge Baptista, Grigori Sidorov, David Pinto
This year{'}s task aims at identifying 14 languages across 6 language groups using a corpus of excerpts of journalistic texts.
no code implementations • WS 2017 • Ilia Markov, Lingzhen Chen, Carlo Strapparava, Grigori Sidorov
We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017.
no code implementations • 17 Oct 2017 • Ignacio Arroyo-Fernández, Carlos-Francisco Méndez-Cruz, Gerardo Sierra, Juan-Manuel Torres-Moreno, Grigori Sidorov
Results showed that our model outperformed the state of the art in well-known Semantic Textual Similarity (STS) benchmarks.
Open-Ended Question Answering Semantic Textual Similarity +3
no code implementations • WS 2018 • Ilia Markov, Vivi Nastase, Carlo Strapparava, Grigori Sidorov
We explore the hypothesis that emotion is one of the dimensions of language that surfaces from the native language into a second language.
no code implementations • SEMEVAL 2019 • Iqra Ameer, Muhammad Hammad Fahim Siddiqui, Grigori Sidorov, Alex Gelbukh, er
The goal of this paper is to detect (A) Hate speech against immigrants and women, (B) Aggressive behavior and target classification, both for English and Spanish.
no code implementations • LREC 2020 • Maaz Amjad, Grigori Sidorov, Alisa Zhila
As the fake news phenomenon is omnipresent across all languages, it is crucial to be able to efficiently solve this problem for languages other than English.
no code implementations • 9 Nov 2021 • Sabur Butt, Shakshi Sharma, Rajesh Sharma, Grigori Sidorov, Alexander Gelbukh
In the descriptive line of works, where researchers have tried to analyse rumours using NLP approaches, there isnt much emphasis on psycho-linguistics analyses of social media text.
no code implementations • 3 Jul 2022 • Iqra Ameer, Muhammad Arif, Grigori Sidorov, Helena Gòmez-Adorno, Alexander Gelbukh
According to the World health organization (WHO), approximately 450 million people are affected.
no code implementations • 3 Jul 2022 • Germán Ortiz, Gemma Bel Enguix, Helena Gómez-Adorno, Iqra Ameer, Grigori Sidorov
Both policy and research benefit from a better understanding of individuals' jobs.
no code implementations • 11 Jul 2022 • Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Alisa Zhila, Grigori Sidorov, Alexander Gelbukh
Admittedly, while training sets from the past and the current years overlap to a large extent, the testing set provided this year is completely different.
no code implementations • 11 Jul 2022 • Maaz Amjad, Sabur Butt, Hamza Imam Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh
This study reports the second shared task named as UrduFake@FIRE2021 on identifying fake news detection in Urdu language.
1 code implementation • 14 Jul 2022 • Maaz Amjad, Alisa Zhila, Grigori Sidorov, Andrey Labunets, Sabur Butta, Hamza Imam Amjad, Oxana Vitman, Alexander Gelbukh
In this paper, we present two shared tasks of abusive and threatening language detection for the Urdu language which has more than 170 million speakers worldwide.
no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso
This overview paper describes the first shared task on fake news detection in Urdu language.
no code implementations • 25 Jul 2022 • Maaz Amjad, Grigori Sidorov, Alisa Zhila, Alexander Gelbukh, Paolo Rosso
This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language.
no code implementations • 23 Oct 2022 • Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov
Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally.
no code implementations • 25 Oct 2022 • Fazlourrahman Balouchzahi, Grigori Sidorov, Alexander Gelbukh
This strict annotation process resulted in promising performance for simple machine learning classifiers with only bi-grams; however, binary and multiclass hope speech detection results reveal that contextual embedding models have higher performance in this dataset.
2 code implementations • 27 Oct 2022 • Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam, Abinew Ali Ayele, Silesh Bogale Haile, Grigori Sidorov, Alexander Gelbukh
Machine translation (MT) is one of the main tasks in natural language processing whose objective is to translate texts automatically from one natural language to another.
no code implementations • 23 Nov 2022 • Oxana Vitman, Yevhen Kostiuk, Grigori Sidorov, Alexander Gelbukh
We use a pre-trained transformer and CNN to capture context features, and we use transformers pre-trained on emotions detection and sentiment analysis tasks.
no code implementations • 26 Nov 2022 • Atnafu Lambebo Tonja, Mesay Gemeda Yigezu, Olga Kolesnikova, Moein Shahiki Tash, Grigori Sidorov, Alexander Gelbuk
Using code-mixed data in natural language processing (NLP) research currently gets a lot of attention.
no code implementations • 14 Dec 2022 • Fazlourrahman Balouchzahi, Sabur Butt, Grigori Sidorov, Alexander Gelbukh
In this paper, we present a study of regret and its expression on social media platforms.
no code implementations • 6 Mar 2023 • Abdul Gafar Manuel Meque, Nisar Hussain, Grigori Sidorov, Alexander Gelbukh
We introduce a novel Natural Language Processing (NLP) task called Guilt detection, which focuses on detecting guilt in text.
no code implementations • 13 Mar 2023 • Olumide Ebenezer Ojo, Hoang Thang Ta, Alexander Gelbukh, Hiram Calvo, Olaronke Oluwayemisi Adebanji, Grigori Sidorov
The performance of the four models that were used to detect disaster in the text was compared.
no code implementations • 27 May 2023 • Atnafu Lambebo Tonja, Christian Maldonado-Sifuentes, David Alejandro Mendoza Castillo, Olga Kolesnikova, Noé Castro-Sánchez, Grigori Sidorov, Alexander Gelbukh
In this paper, we present a parallel Spanish-Mazatec and Spanish-Mixtec corpus for machine translation (MT) tasks, where Mazatec and Mixtec are two indigenous Mexican languages.
no code implementations • 27 May 2023 • Atnafu Lambebo Tonja, Hellina Hailu Nigatu, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh, Jugal Kalita
This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas.
no code implementations • 2 Jun 2023 • Yevhen Kostiuk, Atnafu Lambebo Tonja, Grigori Sidorov, Olga Kolesnikova
In this paper, we investigate the issue of hate speech by presenting a novel task of translating hate speech into non-hate speech text while preserving its meaning.
no code implementations • 7 Nov 2023 • Yevhen Kostiuk, Grigori Sidorov, Olga Kolesnikova
In this paper, we present a dataset of most frequent Spanish verb-noun collocations and sentences where they occur, each collocation is assigned to one of 37 lexical functions defined as classes for a hierarchical classification task.
no code implementations • 15 Jan 2024 • Abdul Gafar Manuel Meque, Jason Angel, Grigori Sidorov, Alexander Gelbukh
In recent years, language models and deep learning techniques have revolutionized natural language processing tasks, including emotion detection.
no code implementations • 29 Jan 2024 • Sabur Butt, Fazlourrahman Balouchzahi, Abdul Gafar Manuel Meque, Maaz Amjad, Hector G. Ceballos Cancino, Grigori Sidorov, Alexander Gelbukh
The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being.
no code implementations • LTEDI (ACL) 2022 • Anusha Gowda, Fazlourrahman Balouchzahi, Hosahalli Shashirekha, Grigori Sidorov
Spreading positive vibes or hope content on social media may help many people to get motivated in their life.
no code implementations • LTEDI (ACL) 2022 • Fazlourrahman Balouchzahi, Sabur Butt, Grigori Sidorov, Alexander Gelbukh
Hope is an inherent part of human life and essential for improving the quality of life.
no code implementations • ICON 2021 • Fazlourrahman Balouchzahi, Oxana Vitman, Hosahalli Lakshmaiah Shashirekha, Grigori Sidorov, Alexander Gelbukh
These approaches obtained the highest performance in the shared task for Meitei, Bangla, and Multilingual texts with instance-F1 scores of 0. 350, 0. 412, and 0. 380 respectively using Pre-aggregation of labels.
1 code implementation • DravidianLangTech (ACL) 2022 • Fazlourrahman Balouchzahi, Anusha Gowda, Hosahalli Shashirekha, Grigori Sidorov
To address the automatic detection of abusive languages in online platforms, this paper describes the models submitted by our team - MUCIC to the shared task on “Abusive Comment Detection in Tamil-ACL 2022”.
no code implementations • WMT (EMNLP) 2020 • Luis A. Menéndez-Salazar, Grigori Sidorov, Marta R. Costa-jussà
This paper describes the participation of the NLP research team of the IPN Computer Research center in the WMT 2020 Similar Language Translation Task.
no code implementations • SMM4H (COLING) 2022 • Atnafu Lambebo Tonja, Olumide Ebenezer Ojo, Mohammed Arif Khan, Abdul Gafar Manuel Meque, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh
This paper describes our submissions for the Social Media Mining for Health (SMM4H) 2022 shared tasks.