no code implementations • ACL (CASE) 2021 • Hansi Hettiarachchi, Mariam Adedoyin-Olowe, Jagdev Bhogal, Mohamed Medhat Gaber
Automatic socio-political and crisis event detection has been a challenge for natural language processing as well as social and political science communities, due to the diversity and nuance in such events and high accuracy requirements.
no code implementations • ACL (CASE) 2021 • Salvatore Giorgi, Vanni Zavarella, Hristo Tanev, Nicolas Stefanovitch, Sy Hwang, Hansi Hettiarachchi, Tharindu Ranasinghe, Vivek Kalyan, Paul Tan, Shaun Tan, Martin Andrews, Tiancheng Hu, Niklas Stoehr, Francesco Ignazio Re, Daniel Vegh, Dennis Atzenhofer, Brenda Curtis, Ali Hürriyetoğlu
Evaluating the state-of-the-art event detection systems on determining spatio-temporal distribution of the events on the ground is performed unfrequently.
no code implementations • RANLP 2021 • Lasitha Uyangodage, Tharindu Ranasinghe, Hansi Hettiarachchi
False information detection has thus become a surging research topic in recent months.
4 code implementations • 25 Mar 2024 • Hansi Hettiarachchi, Damith Premasiri, Lasitha Uyangodage, Tharindu Ranasinghe
NSINA is the largest news corpus for Sinhala, available up to date.
1 code implementation • 4 Mar 2024 • Hansi Hettiarachchi, Amna Dridi, Mohamed Medhat Gaber, Pouyan Parsafard, Nicoleta Bocaneala, Katja Breitenfelder, Gonçal Costa, Maria Hedblom, Mihaela Juganaru-Mathieu, Thamer Mecharnia, Sumee Park, He Tan, Abdel-Rahman H. Tawil, Edlira Vakaj
CODE-ACCORD comprises 862 self-contained sentences extracted from the building regulations of England and Finland.
1 code implementation • 1 Dec 2022 • Tharindu Ranasinghe, Isuri Anuradha, Damith Premasiri, Kanishka Silva, Hansi Hettiarachchi, Lasitha Uyangodage, Marcos Zampieri
SOLD is a manually annotated dataset containing 10, 000 posts from Twitter annotated as offensive and not offensive at both sentence-level and token-level, improving the explainability of the ML models.
no code implementations • 22 Nov 2022 • Fiona Anting Tan, Hansi Hettiarachchi, Ali Hürriyetoğlu, Tommaso Caselli, Onur Uca, Farhana Ferdousi Liza, Nelleke Oostdijk
The best F1 scores achieved for Subtask 1 and 2 were 86. 19% and 54. 15%, respectively.
no code implementations • 21 Nov 2022 • Ali Hürriyetoğlu, Osman Mutlu, Fırat Duruşan, Onur Uca, Alaeddin Selçuk Gürel, Benjamin Radford, Yaoyao Dai, Hansi Hettiarachchi, Niklas Stoehr, Tadashi Nomoto, Milena Slavcheva, Francielle Vargas, Aaqib Javid, Fatih Beyhan, Erdem Yörük
The CASE 2022 extension consists of expanding the test data with more data in previously available languages, namely, English, Hindi, Portuguese, and Spanish, and adding new test data in Mandarin, Turkish, and Urdu for Sub-task 1, document classification.
1 code implementation • LREC 2022 • Fiona Anting Tan, Ali Hürriyetoğlu, Tommaso Caselli, Nelleke Oostdijk, Tadashi Nomoto, Hansi Hettiarachchi, Iqra Ameer, Onur Uca, Farhana Ferdousi Liza, Tiancheng Hu
Leveraging each of these external datasets for training, we achieved up to approximately 64% F1 on the CNC test set without additional fine-tuning.
1 code implementation • NAACL (NLP4IF) 2021 • Lasitha Uyangodage, Tharindu Ranasinghe, Hansi Hettiarachchi
NLP4IF-2021 shared task on fighting the COVID-19 infodemic has been organised to strengthen the research in false information detection where the participants are asked to predict seven different binary labels regarding false information in a tweet.
no code implementations • SEMEVAL 2021 • Hansi Hettiarachchi, Tharindu Ranasinghe
Identifying whether a word carries the same meaning or different meaning in two contexts is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.
no code implementations • SEMEVAL 2020 • Tharindu Ranasinghe, Hansi Hettiarachchi
In this paper, we describe the team \textit{BRUMS} entry to OffensEval 2: Multilingual Offensive Language Identification in Social Media in SemEval-2020.
1 code implementation • SEMEVAL 2020 • Hansi Hettiarachchi, Tharindu Ranasinghe
This paper presents the team BRUMS submission to SemEval-2020 Task 3: Graded Word Similarity in Context.
no code implementations • 13 Oct 2020 • Tharindu Ranasinghe, Hansi Hettiarachchi
In this paper, we describe the team \textit{BRUMS} entry to OffensEval 2: Multilingual Offensive Language Identification in Social Media in SemEval-2020.
1 code implementation • EMNLP (WNUT) 2020 • Hansi Hettiarachchi, Tharindu Ranasinghe
Identifying informative tweets is an important step when building information extraction systems based on social media.
2 code implementations • 10 Jun 2020 • Hansi Hettiarachchi, Mariam Adedoyin-Olowe, Jagdev Bhogal, Mohamed Medhat Gaber
Social media is becoming a primary medium to discuss what is happening around the world.
no code implementations • RANLP 2019 • Hansi Hettiarachchi, Tharindu Ranasinghe
This paper describes a novel research approach to detect type and target of offensive posts in social media using a capsule network.