no code implementations • ACL (CODI, CRAC) 2021 • Sopan Khosla, Juntao Yu, Ramesh Manuvinakurike, Vincent Ng, Massimo Poesio, Michael Strube, Carolyn Rosé
In this paper, we provide an overview of the CODI-CRAC 2021 Shared-Task: Anaphora Resolution in Dialogue.
no code implementations • games (LREC) 2022 • Fatima Althani, Chris Madge, Massimo Poesio
Games-with-a-purpose find attracting players a challenge.
no code implementations • ACL (BPPF) 2021 • Valerio Basile, Michael Fell, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio, Alexandra Uma
Instead, we suggest that we need to better capture the sources of disagreement to improve today’s evaluation practice.
no code implementations • Findings (EMNLP) 2021 • Janosch Haber, Massimo Poesio
One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts.
no code implementations • COLING (CRAC) 2020 • Abdulrahman Aloraini, Massimo Poesio
We propose a BERT-based multilingual model for AZP identification from predicted zero pronoun positions, and evaluate it on the Arabic and Chinese portions of OntoNotes 5. 0.
no code implementations • PaM 2020 • Janosch Haber, Massimo Poesio
Homonymy is often used to showcase one of the advantages of context-sensitive word embedding techniques such as ELMo and BERT.
no code implementations • LREC 2022 • Juntao Yu, Sopan Khosla, Nafise Sadat Moosavi, Silviu Paun, Sameer Pradhan, Massimo Poesio
It also supports the evaluation of split antecedent anaphora and discourse deixis, for which no tools existed.
no code implementations • LREC 2022 • Dina Almanea, Massimo Poesio
The use of misogynistic and sexist language has increased in recent years in social media, and is increasing in the Arabic world in reaction to reforms attempting to remove restrictions on women lives.
no code implementations • ACL 2022 • Tommaso Fornaciari, Alexandra Uma, Massimo Poesio, Dirk Hovy
Natural Language Processing (NLP) ‘s applied nature makes it necessary to select the most effective and robust models.
no code implementations • COLING (WANLP) 2020 • Abdulrahman Aloraini, Massimo Poesio, Ayman Alhelbawy
We present the Arabic dialect identification system that we used for the country-level subtask of the NADI challenge.
no code implementations • COLING (CODI, CRAC) 2022 • Juntao Yu, Sopan Khosla, Ramesh Manuvinakurike, Lori Levin, Vincent Ng, Massimo Poesio, Michael Strube, Carolyn Rosé
The CODI-CRAC 2022 Shared Task on Anaphora Resolution in Dialogues is the second edition of an initiative focused on detecting different types of anaphoric relations in conversations of different kinds.
no code implementations • 15 Nov 2024 • Maja Pavlovic, Massimo Poesio
With the increasing capabilities of LLMs, recent studies focus on understanding whose opinions are represented by them and how to effectively extract aligned opinion distributions.
1 code implementation • 9 Sep 2024 • Yujian Gan, Changling Li, Jinxia Xie, Luou Wen, Matthew Purver, Massimo Poesio
The benchmark includes 31 different task types, each with 10 unique dialogue scenarios between information seeker and provider agents.
no code implementations • 17 Jul 2024 • Chris Madge, Massimo Poesio
In this work we proposing adapting the Minecraft builder task into an LLM benchmark suitable for evaluating LLM ability in spatially orientated tasks, and informing builder agent design.
no code implementations • 2 May 2024 • Maja Pavlovic, Massimo Poesio
Large Language Models (LLMs) have emerged as powerful support tools across various natural language tasks and a range of application domains.
no code implementations • 16 Apr 2024 • Pengcheng Lu, Massimo Poesio
Resolving coreference and bridging relations in chemical patents is important for better understanding the precise chemical process, where chemical domain knowledge is very critical.
1 code implementation • 9 Mar 2024 • Teun van der Weij, Massimo Poesio, Nandi Schoots
In this paper, we investigate the efficacy of activation steering for broad skills and multiple behaviours.
no code implementations • 13 Feb 2024 • Chris Madge, Massimo Poesio
In this work we examine the use of Large Language Models (LLMs) in the challenging setting of acting as a Minecraft agent.
no code implementations • 28 Apr 2023 • Elisa Leonardelli, Alexandra Uma, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Massimo Poesio
We report on the second LeWiDi shared task, which differs from the first edition in three crucial respects: (i) it focuses entirely on NLP, instead of both NLP and computer vision tasks in its first edition; (ii) it focuses on subjective tasks, instead of covering different types of disagreements-as training with aggregated labels for subjective NLP tasks is a particularly obvious misrepresentation of the data; and (iii) for the evaluation, we concentrate on soft approaches to evaluation.
no code implementations • 21 Oct 2022 • Abdulrahman Aloraini, Sameer Pradhan, Massimo Poesio
Most existing proposals about anaphoric zero pronoun (AZP) resolution regard full mention coreference and AZP resolution as two independent tasks, even though the two tasks are clearly related.
no code implementations • 11 Oct 2022 • Juntao Yu, Silviu Paun, Maris Camilleri, Paloma Carretero Garcia, Jon Chamberlain, Udo Kruschwitz, Massimo Poesio
Although several datasets annotated for anaphoric reference/coreference exist, even the largest such datasets have limitations in terms of size, range of domains, coverage of anaphoric phenomena, and size of documents included.
1 code implementation • 24 May 2022 • Silviu Paun, Juntao Yu, Nafise Sadat Moosavi, Massimo Poesio
Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets.
no code implementations • 27 Sep 2021 • Janosch Haber, Massimo Poesio
One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts.
no code implementations • CRAC (ACL) 2021 • Pengcheng Lu, Massimo Poesio
Issues with coreference resolution are one of the most frequently mentioned challenges for information extraction from the biomedical literature.
no code implementations • CRAC (ACL) 2021 • Abdulrahman Aloraini, Massimo Poesio
In pro-drop language like Arabic, Chinese, Italian, Japanese, Spanish, and many others, unrealized (null) arguments in certain syntactic positions can refer to a previously introduced entity, and are thus called anaphoric zero pronouns.
no code implementations • SEMEVAL 2021 • Alexandra Uma, Tommaso Fornaciari, Anca Dumitrache, Tristan Miller, Jon Chamberlain, Barbara Plank, Edwin Simpson, Massimo Poesio
Disagreement between coders is ubiquitous in virtually all datasets annotated with human judgements in both natural language processing and computer vision.
no code implementations • NAACL 2021 • Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy, Massimo Poesio
Supervised learning assumes that a ground truth label exists.
1 code implementation • NAACL 2021 • Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Massimo Poesio
Split-antecedent anaphora is rarer and more complex to resolve than single-antecedent anaphora; as a result, it is not annotated in many datasets designed to test coreference, and previous work on resolving this type of anaphora was carried out in unrealistic conditions that assume gold mentions and/or gold split-antecedent anaphors are available.
no code implementations • EACL 2021 • Tommaso Fornaciari, Federico Bianchi, Massimo Poesio, Dirk Hovy
In most cases, however, the target texts{'} preceding context is not considered.
no code implementations • Joint Conference on Lexical and Computational Semantics 2020 • Janosch Haber, Massimo Poesio
Co-predication is one of the most frequently used linguistic tests to tell apart shifts in polysemic sense from changes in homonymic meaning.
1 code implementation • COLING 2020 • Juntao Yu, Massimo Poesio
can be achieved on full bridging resolution with this architecture.
1 code implementation • COLING (CRAC) 2020 • Abdulrahman Aloraini, Juntao Yu, Massimo Poesio
No neural coreference resolver for Arabic exists, in fact we are not aware of any learning-based coreference resolver for Arabic since (Bjorkelund and Kuhn, 2014).
1 code implementation • COLING 2020 • Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Massimo Poesio
One limitation of virtually all coreference resolution models is the focus on single-antecedent anaphors.
1 code implementation • ACL 2020 • Juntao Yu, Bernd Bohnet, Massimo Poesio
Named Entity Recognition (NER) is a fundamental task in Natural Language Processing, concerned with identifying spans of text expressing references to entities.
Ranked #2 on
Named Entity Recognition (NER)
on GENIA
no code implementations • LREC 2020 • Jon Chamberlain, Udo Kruschwitz, Massimo Poesio
Crowdsourcing approaches provide a difficult design challenge for developers.
no code implementations • LREC 2020 • Osman Doruk Kicikoglu, Richard Bartle, Jon Chamberlain, Silviu Paun, Massimo Poesio
As the uses of Games-With-A-Purpose (GWAPs) broadens, the systems that incorporate its usages have expanded in complexity.
no code implementations • LREC 2020 • Abdulrahman Aloraini, Massimo Poesio
In languages like Arabic, Chinese, Italian, Japanese, Korean, Portuguese, Spanish, and many others, predicate arguments in certain syntactic positions are not realized instead of being realized as overt pronouns, and are thus called zero- or null-pronouns.
1 code implementation • 7 Mar 2020 • Juntao Yu, Massimo Poesio
can be achieved on full bridging resolution with this architecture.
1 code implementation • LREC 2020 • Juntao Yu, Alexandra Uma, Massimo Poesio
In this paper, we introduce an architecture to simultaneously identify non-referring expressions (including expletives, predicative s, and other types) and build coreference chains, including singletons.
Ranked #1 on
Coreference Resolution
on The ARRAU Corpus
1 code implementation • 25 Oct 2019 • Keith Vella, Massimo Poesio, Michael Sigamani, Cihan Dogan, Aimore Dutra, Dimitrios Dimakopoulos, Alfredo Gemma, Ella Walters
We present an automated evaluation method to measure fluidity in conversational dialogue systems.
no code implementations • 25 Sep 2019 • Silviu Paun, Juntao Yu, Jon Chamberlain, Udo Kruschwitz, Massimo Poesio
The model is also flexible enough to be used in standard annotation tasks for classification where it registers on par performance with the state of the art.
1 code implementation • LREC 2020 • Juntao Yu, Bernd Bohnet, Massimo Poesio
We then evaluate our models for coreference resolution by using mentions predicted by our best model in start-of-the-art coreference systems.
1 code implementation • ACL 2019 • Chris Madge, Juntao Yu, Jon Chamberlain, Udo Kruschwitz, Silviu Paun, Massimo Poesio
One of the key steps in language resource creation is the identification of the text segments to be annotated, or markables, which depending on the task may vary from nominal chunks for named entity resolution to (potentially nested) noun phrases in coreference resolution (or mentions) to larger text segments in text segmentation.
1 code implementation • ACL 2019 • Nafise Sadat Moosavi, Leo Born, Massimo Poesio, Michael Strube
To address this problem, minimum spans are manually annotated in smaller corpora.
no code implementations • NAACL 2019 • Massimo Poesio, Jon Chamberlain, Silviu Paun, Juntao Yu, Alex Uma, ra, Udo Kruschwitz
The corpus, containing annotations for about 108, 000 markables, is one of the largest corpora for coreference for English, and one of the largest crowdsourced NLP corpora, but its main feature is the large number of judgments per markable: 20 on average, and over 2. 2M in total.
no code implementations • EMNLP 2018 • Silviu Paun, Jon Chamberlain, Udo Kruschwitz, Juntao Yu, Massimo Poesio
The availability of large scale annotated corpora for coreference is essential to the development of the field.
no code implementations • WS 2018 • Massimo Poesio, Yulia Grishina, Varada Kolhatkar, Nafise Moosavi, Ina Roesiger, Adam Roussel, Fabian Simonjetz, Alex Uma, ra, Olga Uryupina, Juntao Yu, Heike Zinsmeister
The most distinctive feature of the corpus is the annotation of a wide range of anaphoric relations, including bridging references and discourse deixis in addition to identity (coreference).
no code implementations • TACL 2018 • Silviu Paun, Bob Carpenter, Jon Chamberlain, Dirk Hovy, Udo Kruschwitz, Massimo Poesio
We evaluate these models along four aspects: comparison to gold labels, predictive accuracy for new annotations, annotator characterization, and item difficulty, using four datasets with varying degrees of noise in the form of random (spammy) annotators.
no code implementations • WS 2017 • Sophie Chesney, Maria Liakata, Massimo Poesio, Matthew Purver
This paper discusses the problem of incongruent headlines: those which do not accurately represent the information contained in the article with which they occur.
no code implementations • TACL 2017 • Andrew J. Anderson, Douwe Kiela, Stephen Clark, Massimo Poesio
Dual coding theory considers concrete concepts to be encoded in the brain both linguistically and visually, and abstract concepts only linguistically.
no code implementations • WS 2016 • Fabio Celli, Evgeny Stepanov, Massimo Poesio, Giuseppe Riccardi
On June 23rd 2016, UK held the referendum which ratified the exit from the EU.
no code implementations • LREC 2016 • Jon Chamberlain, Massimo Poesio, Udo Kruschwitz
Corpora are typically annotated by several experts to create a gold standard; however, there are now compelling reasons to use a non-expert crowd to annotate text, driven by cost, speed and scalability.
no code implementations • LREC 2016 • Olga Uryupina, Ron artstein, Antonella Bristot, Federica Cavicchio, Kepa Rodriguez, Massimo Poesio
This paper presents a second release of the ARRAU dataset: a multi-domain corpus with thorough linguistically motivated annotation of anaphora and related phenomena.
no code implementations • LREC 2016 • Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, Hugo Zaragoza
In this paper we present the OnForumS corpus developed for the shared task of the same name on Online Forum Summarisation (OnForumS at MultiLing{'}15).
no code implementations • TACL 2015 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio
Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change.
no code implementations • LREC 2014 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio
We present a free, Java-based library named {``}AraNLP{''} that covers various Arabic text preprocessing tools.
no code implementations • 12 Feb 2014 • Fabio Celli, Massimo Poesio
We present PR2, a personality recognition system available online, that performs instance-based classification of Big5 personality types from unstructured text, using language-independent features.
no code implementations • LREC 2012 • Tommaso Fornaciari, Massimo Poesio
In criminal proceedings, sometimes it is not easy to evaluate the sincerity of oral testimonies.
no code implementations • LREC 2012 • Olga Uryupina, Massimo Poesio
Several corpora annotated for coreference have been made available in the past decade.