Search Results for author: Hansi Hettiarachchi

Found 17 papers, 8 papers with code

DAAI at CASE 2021 Task 1: Transformer-based Multilingual Socio-political and Crisis Event Detection

no code implementations • ACL (CASE) 2021 • Hansi Hettiarachchi, Mariam Adedoyin-Olowe, Jagdev Bhogal, Mohamed Medhat Gaber

Automatic socio-political and crisis event detection has been a challenge for natural language processing as well as social and political science communities, due to the diversity and nuance in such events and high accuracy requirements.

Event Detection Sentence

Paper
Add Code

Discovering Black Lives Matter Events in the United States: Shared Task 3, CASE 2021

no code implementations • ACL (CASE) 2021 • Salvatore Giorgi, Vanni Zavarella, Hristo Tanev, Nicolas Stefanovitch, Sy Hwang, Hansi Hettiarachchi, Tharindu Ranasinghe, Vivek Kalyan, Paul Tan, Shaun Tan, Martin Andrews, Tiancheng Hu, Niklas Stoehr, Francesco Ignazio Re, Daniel Vegh, Dennis Atzenhofer, Brenda Curtis, Ali Hürriyetoğlu

Evaluating the state-of-the-art event detection systems on determining spatio-temporal distribution of the events on the ground is performed unfrequently.

Event Detection

Paper
Add Code

Can Multilingual Transformers Fight the COVID-19 Infodemic?

no code implementations • RANLP 2021 • Lasitha Uyangodage, Tharindu Ranasinghe, Hansi Hettiarachchi

False information detection has thus become a surging research topic in recent months.

BIG-bench Machine Learning

Paper
Add Code

NSINA: A News Corpus for Sinhala

4 code implementations • 25 Mar 2024 • Hansi Hettiarachchi, Damith Premasiri, Lasitha Uyangodage, Tharindu Ranasinghe

NSINA is the largest news corpus for Sinhala, available up to date.

Benchmarking Headline Generation

Paper
Code

CODE-ACCORD: A Corpus of Building Regulatory Data for Rule Generation towards Automatic Compliance Checking

1 code implementation • 4 Mar 2024 • Hansi Hettiarachchi, Amna Dridi, Mohamed Medhat Gaber, Pouyan Parsafard, Nicoleta Bocaneala, Katja Breitenfelder, Gonçal Costa, Maria Hedblom, Mihaela Juganaru-Mathieu, Thamer Mecharnia, Sumee Park, He Tan, Abdel-Rahman H. Tawil, Edlira Vakaj

CODE-ACCORD comprises 862 self-contained sentences extracted from the building regulations of England and Finland.

Relation Extraction Sentence +2

Paper
Code

SOLD: Sinhala Offensive Language Dataset

1 code implementation • 1 Dec 2022 • Tharindu Ranasinghe, Isuri Anuradha, Damith Premasiri, Kanishka Silva, Hansi Hettiarachchi, Lasitha Uyangodage, Marcos Zampieri

SOLD is a manually annotated dataset containing 10, 000 posts from Twitter annotated as offensive and not offensive at both sentence-level and token-level, improving the explainability of the ML models.

Language Identification Sentence

Paper
Code

Event Causality Identification with Causal News Corpus -- Shared Task 3, CASE 2022

no code implementations • 22 Nov 2022 • Fiona Anting Tan, Hansi Hettiarachchi, Ali Hürriyetoğlu, Tommaso Caselli, Onur Uca, Farhana Ferdousi Liza, Nelleke Oostdijk

The best F1 scores achieved for Subtask 1 and 2 were 86. 19% and 54. 15%, respectively.

Binary Classification Event Causality Identification +1

Paper
Add Code

Extended Multilingual Protest News Detection -- Shared Task 1, CASE 2021 and 2022

no code implementations • 21 Nov 2022 • Ali Hürriyetoğlu, Osman Mutlu, Fırat Duruşan, Onur Uca, Alaeddin Selçuk Gürel, Benjamin Radford, Yaoyao Dai, Hansi Hettiarachchi, Niklas Stoehr, Tadashi Nomoto, Milena Slavcheva, Francielle Vargas, Aaqib Javid, Fatih Beyhan, Erdem Yörük

The CASE 2022 extension consists of expanding the test data with more data in previously available languages, namely, English, Hindi, Portuguese, and Spanish, and adding new test data in Mandarin, Turkish, and Urdu for Sub-task 1, document classification.

Document Classification Event Detection +3

Paper
Add Code

The Causal News Corpus: Annotating Causal Relations in Event Sentences from News

1 code implementation • LREC 2022 • Fiona Anting Tan, Ali Hürriyetoğlu, Tommaso Caselli, Nelleke Oostdijk, Tadashi Nomoto, Hansi Hettiarachchi, Iqra Ameer, Onur Uca, Farhana Ferdousi Liza, Tiancheng Hu

Leveraging each of these external datasets for training, we achieved up to approximately 64% F1 on the CNC test set without additional fine-tuning.

Language Modelling

Paper
Code

Transformers to Fight the COVID-19 Infodemic

1 code implementation • NAACL (NLP4IF) 2021 • Lasitha Uyangodage, Tharindu Ranasinghe, Hansi Hettiarachchi

NLP4IF-2021 shared task on fighting the COVID-19 infodemic has been organised to strengthen the research in false information detection where the participants are asked to predict seven different binary labels regarding false information in a tweet.

Paper
Code

TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation

no code implementations • SEMEVAL 2021 • Hansi Hettiarachchi, Tharindu Ranasinghe

Identifying whether a word carries the same meaning or different meaning in two contexts is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction.

Information Retrieval Question Answering +2

Paper
Add Code

BRUMS at SemEval-2020 Task 12: Transformer Based Multilingual Offensive Language Identification in Social Media

no code implementations • SEMEVAL 2020 • Tharindu Ranasinghe, Hansi Hettiarachchi

In this paper, we describe the team \textit{BRUMS} entry to OffensEval 2: Multilingual Offensive Language Identification in Social Media in SemEval-2020.

Language Identification

Paper
Add Code

BRUMS at SemEval-2020 Task 3: Contextualised Embeddings for Predicting the (Graded) Effect of Context in Word Similarity

1 code implementation • SEMEVAL 2020 • Hansi Hettiarachchi, Tharindu Ranasinghe

This paper presents the team BRUMS submission to SemEval-2020 Task 3: Graded Word Similarity in Context.

Position Word Embeddings +1

Paper
Code

BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media

no code implementations • 13 Oct 2020 • Tharindu Ranasinghe, Hansi Hettiarachchi

In this paper, we describe the team \textit{BRUMS} entry to OffensEval 2: Multilingual Offensive Language Identification in Social Media in SemEval-2020.

Language Identification

Paper
Add Code

InfoMiner at WNUT-2020 Task 2: Transformer-based Covid-19 Informative Tweet Extraction

1 code implementation • EMNLP (WNUT) 2020 • Hansi Hettiarachchi, Tharindu Ranasinghe

Identifying informative tweets is an important step when building information extraction systems based on social media.

Task 2

Paper
Code

Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media

2 code implementations • 10 Jun 2020 • Hansi Hettiarachchi, Mariam Adedoyin-Olowe, Jagdev Bhogal, Mohamed Medhat Gaber

Social media is becoming a primary medium to discuss what is happening around the world.

Clustering Event Detection +3

Paper
Code

Emoji Powered Capsule Network to Detect Type and Target of Offensive Posts in Social Media

no code implementations • RANLP 2019 • Hansi Hettiarachchi, Tharindu Ranasinghe

This paper describes a novel research approach to detect type and target of offensive posts in social media using a capsule network.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.