no code implementations • COLING (PEOPLES) 2020 • Boris Marinov, Jennifer Spenader, Tommaso Caselli
The paper focuses on a large collection of Dutch tweets from the Netherlands to get an insight into the perception and reactions of users during the early months of the COVID-19 pandemic.
no code implementations • ACL (LChange) 2021 • Pierpaolo Basile, Annalina Caputo, Tommaso Caselli, Pierluigi Cassotti, Rossella Varvara
The use of automatic methods for the study of lexical semantic change (LSC) has led to the creation of evaluation benchmarks.
1 code implementation • COLING 2022 • Tommaso Caselli, Irene Dini, Felice Dell’Orletta
This paper presents a comprehensive set of probing experiments using a multilingual language model, XLM-R, for temporal relation classification between events in four languages.
no code implementations • SemEval (NAACL) 2022 • Wessel Poelman, Gijs Danoe, Esther Ploeger, Frank van den Berg, Tommaso Caselli, Lukas Edman
This paper describes our system created for the SemEval 2022 Task 3: Presupposed Taxonomies - Evaluating Neural-network Semantics.
1 code implementation • EMNLP (WNUT) 2021 • Rob van der Goot, Alan Ramponi, Arkaitz Zubiaga, Barbara Plank, Benjamin Muller, Iñaki San Vicente Roncal, Nikola Ljubešić, Özlem Çetinoğlu, Rahmad Mahendra, Talha Çolakoğlu, Timothy Baldwin, Tommaso Caselli, Wladimir Sidorenko
This task is beneficial for downstream analysis, as it provides a way to harmonize (often spontaneous) linguistic variation.
no code implementations • ACL (CASE) 2021 • Tommaso Caselli, Osman Mutlu, Angelo Basile, Ali Hürriyetoğlu
We analyze the effect of further retraining BERT with different domain specific data as an unsupervised domain adaptation strategy for event extraction.
no code implementations • ACL (NLP4PosImpact) 2021 • Tommaso Caselli, Roberto Cibin, Costanza Conforti, Enrique Encinas, Maurizio Teli
We introduce 9 guiding principles to integrate Participatory Design (PD) methods in the development of Natural Language Processing (NLP) systems.
1 code implementation • NAACL (WOAH) 2022 • Ward Ruitenbeek, Victor Zwart, Robin Van Der Noord, Zhenja Gnezdilov, Tommaso Caselli
This paper presents a comprehensive corpus for the study of socially unacceptable language in Dutch.
1 code implementation • ACL (WOAH) 2021 • Tommaso Caselli, Arjan Schelhaas, Marieke Weultjes, Folkert Leistra, Hylke van der Veen, Gerben Timmerman, Malvina Nissim
As socially unacceptable language become pervasive in social media platforms, the need for automatic content moderation become more pressing.
no code implementations • EMNLP (NLLP) 2021 • Georgios Tziafas, Eugenie de Saint-Phalle, Wietse de Vries, Clara Egger, Tommaso Caselli
The COVID-19 pandemic has witnessed the implementations of exceptional measures by governments across the world to counteract its impact.
1 code implementation • 15 Jun 2023 • Marco Antonio Stranisci, Rossana Damiano, Enrico Mensa, Viviana Patti, Daniele Radicioni, Tommaso Caselli
The corpus, which includes 20 Wikipedia biographies, was compared with five existing corpora to train a model for the biographical event detection task.
no code implementations • 22 Nov 2022 • Fiona Anting Tan, Hansi Hettiarachchi, Ali Hürriyetoğlu, Tommaso Caselli, Onur Uca, Farhana Ferdousi Liza, Nelleke Oostdijk
The best F1 scores achieved for Subtask 1 and 2 were 86. 19% and 54. 15%, respectively.
1 code implementation • 24 Sep 2022 • Gosse Minnema, Sara Gemelli, Chiara Zanchi, Tommaso Caselli, Malvina Nissim
We then train regression models that predict the salience of GBV participants with respect to different dimensions of perceived responsibility.
1 code implementation • LREC 2022 • Fiona Anting Tan, Ali Hürriyetoğlu, Tommaso Caselli, Nelleke Oostdijk, Tadashi Nomoto, Hansi Hettiarachchi, Iqra Ameer, Onur Uca, Farhana Ferdousi Liza, Tiancheng Hu
Leveraging each of these external datasets for training, we achieved up to approximately 64% F1 on the CNC test set without additional fine-tuning.
no code implementations • ACL 2022 • Gosse Minnema, Sara Gemelli, Chiara Zanchi, Tommaso Caselli, Malvina Nissim
SOCIOFILLMORE is a multilingual tool which helps to bring to the fore the focus or the perspective that a text expresses in depicting an event.
1 code implementation • NAACL (NLP4IF) 2021 • Giorgos Tziafas, Konstantinos Kogkalidis, Tommaso Caselli
This paper describes the TOKOFOU system, an ensemble model for misinformation detection tasks based on six different transformer-based pre-trained encoders, implemented in the context of the COVID-19 Infodemic Shared Task for English.
no code implementations • SEMEVAL 2020 • Davide Colla, Tommaso Caselli, Valerio Basile, Jelena Mitrovi{\'c}, Michael Granitzer
We introduce an approach to multilingual Offensive Language Detection based on the mBERT transformer model.
1 code implementation • ACL (WOAH) 2021 • Tommaso Caselli, Valerio Basile, Jelena Mitrović, Michael Granitzer
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in English.
Ranked #1 on
Hate Speech Detection
on AbusEval
no code implementations • LREC 2020 • Juliet van Rosendaal, Tommaso Caselli, Malvina Nissim
Strategies used until now to increase density of abusive language and obtain more meaningful data overall, include data filtering on the basis of pre-selected keywords and hate-rich sources of data.
no code implementations • LREC 2020 • Tommaso Caselli, Valerio Basile, Jelena Mitrovi{\'c}, Inga Kartoziya, Michael Granitzer
However, there is a lack of data sets that take into account the degree of explicitness.
no code implementations • LREC 2020 • Rob van der Goot, Alan Ramponi, Tommaso Caselli, Michele Cafagna, Lorenzo De Mattei
However, for Italian, there is no benchmark available for lexical normalization, despite the presence of many benchmarks for other tasks involving social media data.
2 code implementations • Findings (EMNLP) 2021 • Firoj Alam, Shaden Shaar, Fahim Dalvi, Hassan Sajjad, Alex Nikolov, Hamdy Mubarak, Giovanni Da San Martino, Ahmed Abdelali, Nadir Durrani, Kareem Darwish, Abdulaziz Al-Homaid, Wajdi Zaghouani, Tommaso Caselli, Gijs Danoe, Friso Stolk, Britt Bruntink, Preslav Nakov
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic.
2 code implementations • 19 Dec 2019 • Wietse de Vries, Andreas van Cranenburgh, Arianna Bisazza, Tommaso Caselli, Gertjan van Noord, Malvina Nissim
The transformer-based pre-trained language model BERT has helped to improve state-of-the-art performance on many natural language processing (NLP) tasks.
Ranked #3 on
Sentiment Analysis
on DBRD
1 code implementation • 4 Oct 2018 • Tommaso Caselli
This paper reports on a set of experiments with different word embeddings to initialize a state-of-the-art Bi-LSTM-CRF network for event detection and classification in Italian, following the EVENTI evaluation exercise.
1 code implementation • COLING 2018 • Tommaso Caselli, Oana Inel
This paper describes a crowdsourcing experiment on the annotation of plot-like structures in English news articles.
no code implementations • WS 2017 • Roxane Segers, Tommaso Caselli, Piek Vossen
In this paper we describe the ongoing work on the Circumstantial Event Ontology (CEO), a newly developed ontology for calamity events that models semantic circumstantial relations between event classes.
no code implementations • WS 2017 • Tommaso Caselli, Piek Vossen
This paper reports on the Event StoryLine Corpus (ESC) v1. 0, a new benchmark dataset for the temporal and causal relation detection.
no code implementations • EACL 2017 • Rachele Sprugnoli, Tommaso Caselli, Sara Tonelli, Giovanni Moretti
This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles.
no code implementations • LREC 2016 • Tommaso Caselli, Rachele Sprugnoli, Oana Inel
This paper describes two sets of crowdsourcing experiments on temporal information annotation conducted on two languages, i. e., English and Italian.
no code implementations • LREC 2016 • Oana Inel, Tommaso Caselli, Lora Aroyo
On the other hand, machines need to understand the information that is published in online data streams and generate concise and meaningful overviews.
no code implementations • LREC 2016 • Tommaso Caselli, Giovanni Moretti, Rachele Sprugnoli, Sara Tonelli, Damien Lanfrey, Donatella Solda Kutzmann
In this paper we present PIERINO (PIattaforma per l{'}Estrazione e il Recupero di INformazione Online), a system that was implemented in collaboration with the Italian Ministry of Education, University and Research to analyse the citizens{'} comments given in {\#}labuonascuola survey.
no code implementations • LREC 2016 • Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo, Piek Vossen
In the last decade, different aspects of linguistic encoding of perspectives have been targeted as separated phenomena through different annotation initiatives.
no code implementations • LREC 2014 • Tommaso Caselli, Laure Vieu, Carlo Strapparava, Guido Vetere
This paper reports on research activities on automatic methods for the enrichment of the Senso Comune platform.
no code implementations • LREC 2012 • Tommaso Caselli, Francesco Rubino, Francesca Frontini, Irene Russo, Valeria Quochi
In addition to this, we assigned to the extracted entries of the lexicon a confidence score based on the relative frequency and evaluated the extractor on domain specific data.
no code implementations • LREC 2012 • Tommaso Caselli, Irene Russo, Francesco Rubino
Sentiment Analysis (SA) and Opinion Mining (OM) have become a popular task in recent years in NLP with the development of language resources, corpora and annotation schemes.
no code implementations • Proceedings of the 5th International Workshop on Semantic Evaluation 2010 • Marc Verhagen, Roser Saurí, Tommaso Caselli, James Pustejovsky
Tempeval-2 comprises evaluation tasks for time expressions, events and temporal relations, the latter of which was split up in four sub tasks, motivated by the notion that smaller subtasks would make both data preparation and temporal relation extraction easier.