Search Results for author: Massimo Poesio

Found 68 papers, 13 papers with code

Named Entity Recognition as Dependency Parsing

1 code implementation • ACL 2020 • Juntao Yu, Bernd Bohnet, Massimo Poesio

Named Entity Recognition (NER) is a fundamental task in Natural Language Processing, concerned with identifying spans of text expressing references to entities.

Ranked #2 on Named Entity Recognition (NER) on GENIA

Dependency Parsing named-entity-recognition +4

340

Paper
Code

Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection

1 code implementation • ACL 2019 • Nafise Sadat Moosavi, Leo Born, Massimo Poesio, Michael Strube

To address this problem, minimum spans are manually annotated in smaller corpora.

Boundary Detection coreference-resolution +1

Paper
Code

Scoring Coreference Chains with Split-Antecedent Anaphors

1 code implementation • 24 May 2022 • Silviu Paun, Juntao Yu, Nafise Sadat Moosavi, Massimo Poesio

Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets.

Paper
Code

Neural Coreference Resolution for Arabic

1 code implementation • COLING (CRAC) 2020 • Abdulrahman Aloraini, Juntao Yu, Massimo Poesio

No neural coreference resolver for Arabic exists, in fact we are not aware of any learning-based coreference resolver for Arabic since (Bjorkelund and Kuhn, 2014).

coreference-resolution

Paper
Code

A Cluster Ranking Model for Full Anaphora Resolution

1 code implementation • LREC 2020 • Juntao Yu, Alexandra Uma, Massimo Poesio

In this paper, we introduce an architecture to simultaneously identify non-referring expressions (including expletives, predicative s, and other types) and build coreference chains, including singletons.

Ranked #1 on Coreference Resolution on The ARRAU Corpus

Coreference Resolution

Paper
Code

Stay Together: A System for Single and Split-antecedent Anaphora Resolution

1 code implementation • NAACL 2021 • Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Massimo Poesio

Split-antecedent anaphora is rarer and more complex to resolve than single-antecedent anaphora; as a result, it is not annotated in many datasets designed to test coreference, and previous work on resolving this type of anaphora was carried out in unrealistic conditions that assume gold mentions and/or gold split-antecedent anaphors are available.

Paper
Code

Neural Mention Detection

1 code implementation • LREC 2020 • Juntao Yu, Bernd Bohnet, Massimo Poesio

We then evaluate our models for coreference resolution by using mentions predicted by our best model in start-of-the-art coreference systems.

coreference-resolution NER

Paper
Code

Multi-task Learning Based Neural Bridging Reference Resolution

1 code implementation • 7 Mar 2020 • Juntao Yu, Massimo Poesio

can be achieved on full bridging resolution with this architecture.

coreference-resolution Multi-Task Learning

Paper
Code

Multitask Learning-Based Neural Bridging Reference Resolution

1 code implementation • COLING 2020 • Juntao Yu, Massimo Poesio

can be achieved on full bridging resolution with this architecture.

coreference-resolution Multi-Task Learning

Paper
Code

Extending Activation Steering to Broad Skills and Multiple Behaviours

1 code implementation • 9 Mar 2024 • Teun van der Weij, Massimo Poesio, Nandi Schoots

In this paper, we investigate the efficacy of activation steering for broad skills and multiple behaviours.

Paper
Code

Measuring Conversational Fluidity in Automated Dialogue Agents

1 code implementation • 25 Oct 2019 • Keith Vella, Massimo Poesio, Michael Sigamani, Cihan Dogan, Aimore Dutra, Dimitrios Dimakopoulos, Alfredo Gemma, Ella Walters

We present an automated evaluation method to measure fluidity in conversational dialogue systems.

Paper
Code

PR2: A Language Independent Unsupervised Tool for Personality Recognition from Text

no code implementations • 12 Feb 2014 • Fabio Celli, Massimo Poesio

We present PR2, a personality recognition system available online, that performs instance-based classification of Big5 personality types from unstructured text, using language-independent features.

General Classification

Paper
Add Code

A Probabilistic Annotation Model for Crowdsourcing Coreference

no code implementations • EMNLP 2018 • Silviu Paun, Jon Chamberlain, Udo Kruschwitz, Juntao Yu, Massimo Poesio

The availability of large scale annotated corpora for coreference is essential to the development of the field.

Coreference Resolution Question Answering

Paper
Add Code

Visually Grounded and Textual Semantic Models Differentially Decode Brain Activity Associated with Concrete and Abstract Nouns

no code implementations • TACL 2017 • Andrew J. Anderson, Douwe Kiela, Stephen Clark, Massimo Poesio

Dual coding theory considers concrete concepts to be encoded in the brain both linguistically and visually, and abstract concepts only linguistically.

Paper
Add Code

Anaphora Resolution with the ARRAU Corpus

no code implementations • WS 2018 • Massimo Poesio, Yulia Grishina, Varada Kolhatkar, Nafise Moosavi, Ina Roesiger, Adam Roussel, Fabian Simonjetz, Alex Uma, ra, Olga Uryupina, Juntao Yu, Heike Zinsmeister

The most distinctive feature of the corpus is the annotation of a wide range of anaphoric relations, including bridging references and discourse deixis in addition to identity (coreference).

Paper
Add Code

Incongruent Headlines: Yet Another Way to Mislead Your Readers

no code implementations • WS 2017 • Sophie Chesney, Maria Liakata, Massimo Poesio, Matthew Purver

This paper discusses the problem of incongruent headlines: those which do not accurately represent the information contained in the article with which they occur.

Paper
Add Code

Coreference Resolution for the Basque Language with BART

no code implementations • WS 2016 • Ander Soraluze, Olatz Arregi, Xabier Arregi, Arantza D{\'\i}az de Ilarraza, Mijail Kabadjov, Massimo Poesio

Chunking coreference-resolution +2

Paper
Add Code

Predicting Brexit: Classifying Agreement is Better than Sentiment and Pollsters

no code implementations • WS 2016 • Fabio Celli, Evgeny Stepanov, Massimo Poesio, Giuseppe Riccardi

On June 23rd 2016, UK held the referendum which ratified the exit from the EU.

General Classification Opinion Mining +1

Paper
Add Code

Combining Minimally-supervised Methods for Arabic Named Entity Recognition

no code implementations • TACL 2015 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio

Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Identifying fake Amazon reviews as learning from crowds

no code implementations • EACL 2014 • Tommaso Fornaciari, Massimo Poesio

Paper
Add Code

Automatic Creation of Arabic Named Entity Annotated Corpus Using Wikipedia

no code implementations • EACL 2014 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio

Morphological Analysis Named Entity Recognition (NER)

Paper
Add Code

Of Words, Eyes and Brains: Correlating Image-Based Distributional Semantic Models with Neural Representations of Concepts

no code implementations • EMNLP 2013 • Andrew J. Anderson, Elia Bruni, Ulisse Bordignon, Massimo Poesio, Marco Baroni

Paper
Add Code

BART goes multilingual: The UniTN / Essex submission to the CoNLL-2012 Shared Task

no code implementations • WS 2012 • Olga Uryupina, Aless Moschitti, ro, Massimo Poesio

Coreference Resolution

Paper
Add Code

MultiLing 2015: Multilingual Summarization of Single and Multi-Documents, On-line Fora, and Call-center Conversations

no code implementations • WS 2015 • George Giannakopoulos, Jeff Kubina, John Conroy, Josef Steinberger, Benoit Favre, Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio

Document Summarization Multi-Document Summarization

Paper
Add Code

On the Use of Homogenous Sets of Subjects in Deceptive Language Analysis

no code implementations • WS 2012 • Tommaso Fornaciari, Massimo Poesio

Deception Detection

Paper
Add Code

Annotating Archaeological Texts: An Example of Domain-Specific Annotation in the Humanities

no code implementations • WS 2012 • Francesca Bonin, Fabio Cavulli, Aronne Noriller, Massimo Poesio, Egon W. Stemle

Paper
Add Code

On discriminating fMRI representations of abstract WordNet taxonomic categories

no code implementations • WS 2012 • Andrew Anderson, Tao Yuan, Brian Murphy, Massimo Poesio

Paper
Add Code

Relational Structures and Models for Coreference Resolution

no code implementations • COLING 2012 • Truc-Vien T. Nguyen, Massimo Poesio

coreference-resolution Relation Extraction

Paper
Add Code

Adapting a State-of-the-art Anaphora Resolution System for Resource-poor Language

no code implementations • IJCNLP 2013 • Utpal Sikdar, Asif Ekbal, Sriparna Saha, Olga Uryupina, Massimo Poesio

Coreference Resolution Text Summarization

Paper
Add Code

AraNLP: a Java-based Library for the Processing of Arabic Text.

no code implementations • LREC 2014 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio

We present a free, Java-based library named {``}AraNLP{''} that covers various Arabic text preprocessing tools.

Information Retrieval POS +2

Paper
Add Code

DeCour: a corpus of DEceptive statements in Italian COURts

no code implementations • LREC 2012 • Tommaso Fornaciari, Massimo Poesio

In criminal proceedings, sometimes it is not easy to evaluate the sincerity of oral testimonies.

Deception Detection

Paper
Add Code

Domain-specific vs. Uniform Modeling for Coreference Resolution

no code implementations • LREC 2012 • Olga Uryupina, Massimo Poesio

Several corpora annotated for coreference have been made available in the past decade.

coreference-resolution Domain Adaptation +1

Paper
Add Code

A Semi-supervised Learning Approach to Arabic Named Entity Recognition

no code implementations • RANLP 2013 • Maha Althobaiti, Udo Kruschwitz, Massimo Poesio

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

A Crowdsourced Corpus of Multiple Judgments and Disagreement on Anaphoric Interpretation

no code implementations • NAACL 2019 • Massimo Poesio, Jon Chamberlain, Silviu Paun, Juntao Yu, Alex Uma, ra, Udo Kruschwitz

The corpus, containing annotations for about 108, 000 markables, is one of the largest corpora for coreference for English, and one of the largest crowdsourced NLP corpora, but its main feature is the large number of judgments per markable: 20 on average, and over 2. 2M in total.

Paper
Add Code

Comparing Bayesian Models of Annotation

no code implementations • TACL 2018 • Silviu Paun, Bob Carpenter, Jon Chamberlain, Dirk Hovy, Udo Kruschwitz, Massimo Poesio

We evaluate these models along four aspects: comparison to gold labels, predictive accuracy for new annotations, annotator characterization, and item difficulty, using four datasets with varying degrees of noise in the form of random (spammy) annotators.

Model Selection

Paper
Add Code

The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015

no code implementations • LREC 2016 • Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, Hugo Zaragoza

In this paper we present the OnForumS corpus developed for the shared task of the same name on Online Forum Summarisation (OnForumS at MultiLing{'}15).

Paper
Add Code

Phrase Detectives Corpus 1.0 Crowdsourced Anaphoric Coreference.

no code implementations • LREC 2016 • Jon Chamberlain, Massimo Poesio, Udo Kruschwitz

Corpora are typically annotated by several experts to create a gold standard; however, there are now compelling reasons to use a non-expert crowd to annotate text, driven by cost, speed and scalability.

text annotation

Paper
Add Code

ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions

no code implementations • LREC 2016 • Olga Uryupina, Ron artstein, Antonella Bristot, Federica Cavicchio, Kepa Rodriguez, Massimo Poesio

This paper presents a second release of the ARRAU dataset: a multi-domain corpus with thorough linguistically motivated annotation of anaphora and related phenomena.

Paper
Add Code

Crowdsourcing and Aggregating Nested Markable Annotations

1 code implementation • ACL 2019 • Chris Madge, Juntao Yu, Jon Chamberlain, Udo Kruschwitz, Silviu Paun, Massimo Poesio

One of the key steps in language resource creation is the identification of the text segments to be annotated, or markables, which depending on the task may vary from nominal chunks for named entity resolution to (potentially nested) noun phrases in coreference resolution (or mentions) to larger text segments in text segmentation.

coreference-resolution Entity Resolution +1

Paper
Code

Speaking Outside the Box: Exploring the Benefits of Unconstrained Input in Crowdsourcing and Citizen Science Platforms

no code implementations • LREC 2020 • Jon Chamberlain, Udo Kruschwitz, Massimo Poesio

Crowdsourcing approaches provide a difficult design challenge for developers.

Paper
Add Code

Aggregation Driven Progression System for GWAPs

no code implementations • LREC 2020 • Osman Doruk Kicikoglu, Richard Bartle, Jon Chamberlain, Silviu Paun, Massimo Poesio

As the uses of Games-With-A-Purpose (GWAPs) broadens, the systems that incorporate its usages have expanded in complexity.

Paper
Add Code

Cross-lingual Zero Pronoun Resolution

no code implementations • LREC 2020 • Abdulrahman Aloraini, Massimo Poesio

In languages like Arabic, Chinese, Italian, Japanese, Korean, Portuguese, Spanish, and many others, predicate arguments in certain syntactic positions are not realized instead of being realized as overt pronouns, and are thus called zero- or null-pronouns.

Machine Translation Translation

Paper
Add Code

Free the Plural: Unrestricted Split-Antecedent Anaphora Resolution

1 code implementation • COLING 2020 • Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Massimo Poesio

One limitation of virtually all coreference resolution models is the focus on single-antecedent anaphors.

coreference-resolution Transfer Learning

Paper
Code

Beyond Black \& White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning

no code implementations • NAACL 2021 • Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy, Massimo Poesio

Supervised learning assumes that a ground truth label exists.

Multi-Task Learning

Paper
Add Code

BERTective: Language Models and Contextual Information for Deception Detection

no code implementations • EACL 2021 • Tommaso Fornaciari, Federico Bianchi, Massimo Poesio, Dirk Hovy

In most cases, however, the target texts{'} preceding context is not considered.

Deception Detection

Paper
Add Code

Assessing Polyseme Sense Similarity through Co-predication Acceptability and Contextualised Embedding Distance

no code implementations • Joint Conference on Lexical and Computational Semantics 2020 • Janosch Haber, Massimo Poesio

Co-predication is one of the most frequently used linguistic tests to tell apart shifts in polysemic sense from changes in homonymic meaning.

Word Embeddings

Paper
Add Code

SemEval-2021 Task 12: Learning with Disagreements

no code implementations • SEMEVAL 2021 • Alexandra Uma, Tommaso Fornaciari, Anca Dumitrache, Tristan Miller, Jon Chamberlain, Barbara Plank, Edwin Simpson, Massimo Poesio

Disagreement between coders is ubiquitous in virtually all datasets annotated with human judgements in both natural language processing and computer vision.

Paper
Add Code

Data Augmentation Methods for Anaphoric Zero Pronouns

no code implementations • CRAC (ACL) 2021 • Abdulrahman Aloraini, Massimo Poesio

In pro-drop language like Arabic, Chinese, Italian, Japanese, Spanish, and many others, unrealized (null) arguments in certain syntactic positions can refer to a previously introduced entity, and are thus called anaphoric zero pronouns.

Data Augmentation

Paper
Add Code

Coreference Resolution for the Biomedical Domain: A Survey

no code implementations • CRAC (ACL) 2021 • Pengcheng Lu, Massimo Poesio

Issues with coreference resolution are one of the most frequently mentioned challenges for information extraction from the biomedical literature.

coreference-resolution

Paper
Add Code

Patterns of Lexical Ambiguity in Contextualised Language Models

no code implementations • 27 Sep 2021 • Janosch Haber, Massimo Poesio

One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts.

Paper
Add Code

The CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue

no code implementations • ACL (CODI, CRAC) 2021 • Sopan Khosla, Juntao Yu, Ramesh Manuvinakurike, Vincent Ng, Massimo Poesio, Michael Strube, Carolyn Rosé

In this paper, we provide an overview of the CODI-CRAC 2021 Shared-Task: Anaphora Resolution in Dialogue.

Paper
Add Code

Patterns of Polysemy and Homonymy in Contextualised Language Models

no code implementations • Findings (EMNLP) 2021 • Janosch Haber, Massimo Poesio

One of the central aspects of contextualised language models is that they should be able to distinguish the meaning of lexically ambiguous words by their contexts.

Paper
Add Code

We Need to Consider Disagreement in Evaluation

no code implementations • ACL (BPPF) 2021 • Valerio Basile, Michael Fell, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio, Alexandra Uma

Instead, we suggest that we need to better capture the sources of disagreement to improve today’s evaluation practice.

Paper
Add Code

The QMUL/HRBDT contribution to the NADI Arabic Dialect Identification Shared Task

no code implementations • COLING (WANLP) 2020 • Abdulrahman Aloraini, Massimo Poesio, Ayman Alhelbawy

We present the Arabic dialect identification system that we used for the country-level subtask of the NADI challenge.

Dialect Identification

Paper
Add Code

Polygloss - A conversational agent for language practice

no code implementations • NLP4CALL (COLING) 2020 • Etiene da Cruz Dalcol, Massimo Poesio

Paper
Add Code

Word Sense Distance in Human Similarity Judgements and Contextualised Word Embeddings

no code implementations • PaM 2020 • Janosch Haber, Massimo Poesio

Homonymy is often used to showcase one of the advantages of context-sensitive word embedding techniques such as ELMo and BERT.

Word Embeddings

Paper
Add Code

Anaphoric Zero Pronoun Identification: A Multilingual Approach

no code implementations • COLING (CRAC) 2020 • Abdulrahman Aloraini, Massimo Poesio

We propose a BERT-based multilingual model for AZP identification from predicted zero pronoun positions, and evaluate it on the Arabic and Chinese portions of OntoNotes 5. 0.

Transfer Learning

Paper
Add Code

A Mention-Pair Model of Annotation with Nonparametric User Communities

no code implementations • 25 Sep 2019 • Silviu Paun, Juntao Yu, Jon Chamberlain, Udo Kruschwitz, Massimo Poesio

The model is also flexible enough to be used in standard annotation tasks for classification where it registers on par performance with the state of the art.

Paper
Add Code

Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa

no code implementations • ACL 2022 • Tommaso Fornaciari, Alexandra Uma, Massimo Poesio, Dirk Hovy

Natural Language Processing (NLP) ‘s applied nature makes it necessary to select the most effective and robust models.

Experimental Design

Paper
Add Code

ArMIS - The Arabic Misogyny and Sexism Corpus with Annotator Subjective Disagreements

no code implementations • LREC 2022 • Dina Almanea, Massimo Poesio

The use of misogynistic and sexist language has increased in recent years in social media, and is increasing in the Arabic world in reaction to reforms attempting to remove restrictions on women lives.

Paper
Add Code

The Universal Anaphora Scorer

no code implementations • LREC 2022 • Juntao Yu, Sopan Khosla, Nafise Sadat Moosavi, Silviu Paun, Sameer Pradhan, Massimo Poesio

It also supports the evaluation of split antecedent anaphora and discourse deixis, for which no tools existed.

Paper
Add Code

Less Text, More Visuals: Evaluating the Onboarding Phase in a GWAP for NLP

no code implementations • games (LREC) 2022 • Fatima Althani, Chris Madge, Massimo Poesio

Games-with-a-purpose find attracting players a challenge.

Paper
Add Code

Aggregating Crowdsourced and Automatic Judgments to Scale Up a Corpus of Anaphoric Reference for Fiction and Wikipedia Texts

no code implementations • 11 Oct 2022 • Juntao Yu, Silviu Paun, Maris Camilleri, Paloma Carretero Garcia, Jon Chamberlain, Udo Kruschwitz, Massimo Poesio

Although several datasets annotated for anaphoric reference/coreference exist, even the largest such datasets have limitations in terms of size, range of domains, coverage of anaphoric phenomena, and size of documents included.

Paper
Add Code

The CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue

no code implementations • COLING (CODI, CRAC) 2022 • Juntao Yu, Sopan Khosla, Ramesh Manuvinakurike, Lori Levin, Vincent Ng, Massimo Poesio, Michael Strube, Carolyn Rosé

The CODI-CRAC 2022 Shared Task on Anaphora Resolution in Dialogues is the second edition of an initiative focused on detecting different types of anaphoric relations in conversations of different kinds.

Paper
Add Code

Joint Coreference Resolution for Zeros and non-Zeros in Arabic

no code implementations • 21 Oct 2022 • Abdulrahman Aloraini, Sameer Pradhan, Massimo Poesio

Most existing proposals about anaphoric zero pronoun (AZP) resolution regard full mention coreference and AZP resolution as two independent tasks, even though the two tasks are clearly related.

coreference-resolution

Paper
Add Code

SemEval-2023 Task 11: Learning With Disagreements (LeWiDi)

no code implementations • 28 Apr 2023 • Elisa Leonardelli, Alexandra Uma, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Massimo Poesio

We report on the second LeWiDi shared task, which differs from the first edition in three crucial respects: (i) it focuses entirely on NLP, instead of both NLP and computer vision tasks in its first edition; (ii) it focuses on subjective tasks, instead of covering different types of disagreements-as training with aggregated labels for subjective NLP tasks is a particularly obvious misrepresentation of the data; and (iii) for the evaluation, we concentrate on soft approaches to evaluation.

Sentiment Analysis

Paper
Add Code

Large Language Models as Minecraft Agents

no code implementations • 13 Feb 2024 • Chris Madge, Massimo Poesio

In this work we examine the use of Large Language Models (LLMs) in the challenging setting of acting as a Minecraft agent.

Paper
Add Code

Integrating knowledge bases to improve coreference and bridging resolution for the chemical domain

no code implementations • 16 Apr 2024 • Pengcheng Lu, Massimo Poesio

Resolving coreference and bridging relations in chemical patents is important for better understanding the precise chemical process, where chemical domain knowledge is very critical.

Chemical Process Multi-Task Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.