Search Results for author: Piek Vossen

Found 79 papers, 20 papers with code

Annotating Targets of Toxic Language at the Span Level

1 code implementation TRAC (COLING) 2022 Baran Barbarestani, Isa Maks, Piek Vossen

We therefore created a crowd-annotation task to mark the spans of words that refer to target communities as an extension of the HateXplain data set.

Efficiently and Thoroughly Anonymizing a Transformer Language Model for Dutch Electronic Health Records: a Two-Step Method

no code implementations LREC 2022 Stella Verkijk, Piek Vossen

Adopting the fill-mask task where the model predicts what tokens are most probable in a certain context, it was tested how often the model will predict a name in a context where a name should be.

Fill Mask Language Modelling

Introducing Frege to Fillmore: A FrameNet Dataset that Captures both Sense and Reference

no code implementations LREC 2022 Levi Remijnse, Piek Vossen, Antske Fokkens, Sam Titarsolej

This article presents the first output of the Dutch FrameNet annotation tool, which facilitates both referential- and frame annotations of language-independent corpora.

Sentence

The Role of Common Ground for Referential Expressions in Social Dialogues

1 code implementation COLING (CRAC) 2022 Jaap Kruijt, Piek Vossen

In this paper, we frame the problem of co-reference resolution in dialogue as a dynamic social process in which mentions to people previously known and newly introduced are mixed when people know each other well.

ReferenceNet: a semantic-pragmatic network for capturing reference relations.

no code implementations GWC 2018 Piek Vossen, Filip Ilievski, Marten Postrma

In this paper, we present ReferenceNet: a semantic-pragmatic network of reference relations between synsets.

Word Embeddings

Variation in framing as a function of temporal reporting distance

1 code implementation IWCS (ACL) 2021 Levi Remijnse, Marten Postma, Piek Vossen

In this paper, we measure variation in framing as a function of foregrounding and backgrounding in a co-referential corpus with a range of temporal distance.

Batavia asked for advice. Pretrained language models for Named Entity Recognition in historical texts.

1 code implementation EMNLP (LaTeCHCLfL, CLFL, LaTeCH) 2021 Sophie I. Arnoult, Lodewijk Petram, Piek Vossen

We consider monolingual and multilingual models from the perspective of historical texts, and in particular for texts enriched with editorial notes: how do language models deal with the historical and editorial content in these texts?

named-entity-recognition Named Entity Recognition +1

Toward a truly multilingual GlobalWordnet Grid

no code implementations GWC 2016 Piek Vossen, Francis Bond, John McCrae

In this paper, we describe a new and improved Global Wordnet Grid that takes advantage of the Collaborative InterLingual Index (CILI).

The Predicate Matrix and the Event and Implied Situation Ontology: Making More of Events

no code implementations GWC 2016 Roxane Segers, Egoitz Laparra, Marco Rospocher, Piek Vossen, German Rigau, Filip Ilievski

This paper presents the Event and Implied Situation Ontology (ESO), a resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.

Open Dutch WordNet

1 code implementation GWC 2016 Marten Postma, Emiel van Miltenburg, Roxane Segers, Anneleen Schoen, Piek Vossen

We describe Open Dutch WordNet, which has been derived from the Cornetto database, the Princeton WordNet and open source resources.

CILI: the Collaborative Interlingual Index

no code implementations GWC 2016 Francis Bond, Piek Vossen, John McCrae, Christiane Fellbaum

This paper introduces the motivation for and design of the Collaborative InterLingual Index (CILI).

When to explain: Identifying explanation triggers in human-agent interaction

no code implementations ACL (NL4XAI, INLG) 2020 Lea Krause, Piek Vossen

With more agents deployed than ever, users need to be able to interact and cooperate with them in an effective and comfortable manner.

A Hybrid Intelligence Method for Argument Mining

no code implementations11 Mar 2024 Michiel van der Meer, Enrico Liscio, Catholijn M. Jonker, Aske Plaat, Piek Vossen, Pradeep K. Murukannaiah

We find that, on the one hand, HyEnA achieves higher coverage and precision than a state-of-the-art automated method when compared to a common set of diverse opinions, justifying the need for human insight.

Argument Mining

An Empirical Analysis of Diversity in Argument Summarization

no code implementations2 Feb 2024 Michiel van der Meer, Piek Vossen, Catholijn M. Jonker, Pradeep K. Murukannaiah

Presenting high-level arguments is a crucial task for fostering participation in online societal discussions.

Do Differences in Values Influence Disagreements in Online Discussions?

1 code implementation24 Oct 2023 Michiel van der Meer, Piek Vossen, Catholijn M. Jonker, Pradeep K. Murukannaiah

We investigate a hypothesis that differences in personal values are indicative of disagreement in online discussions.

Reasoning about Ambiguous Definite Descriptions

1 code implementation23 Oct 2023 Stefan F. Schouten, Peter Bloem, Ilia Markov, Piek Vossen

But no resources exist to evaluate how well Large Language Models can use explicit reasoning to resolve ambiguity in language.

Cross-Domain Toxic Spans Detection

1 code implementation16 Jun 2023 Stefan F. Schouten, Baran Barbarestani, Wondimagegnhue Tufa, Piek Vossen, Ilia Markov

Given the dynamic nature of toxic language use, automated methods for detecting toxic spans are likely to encounter distributional shift.

Toxic Spans Detection

A Machine with Short-Term, Episodic, and Semantic Memory Systems

1 code implementation5 Dec 2022 Taewoon Kim, Michael Cochez, Vincent François-Lavet, Mark Neerincx, Piek Vossen

Inspired by the cognitive science theory of the explicit human memory systems, we have modeled an agent with short-term, episodic, and semantic memory systems, each of which is modeled with a knowledge graph.

Q-Learning Reinforcement Learning (RL)

Evaluating Agent Interactions Through Episodic Knowledge Graphs

1 code implementation CCGPK (COLING) 2022 Selene Báez Santamaría, Piek Vossen, Thomas Baier

We present a new method based on episodic Knowledge Graphs (eKGs) for evaluating (multimodal) conversational agents in open domains.

Knowledge Graphs

A modular architecture for creating multimodal agents

no code implementations1 Jun 2022 Thomas Baier, Selene Baez Santamaria, Piek Vossen

The paper describes a flexible and modular platform to create multimodal interactive agents.

A Machine With Human-Like Memory Systems

1 code implementation4 Apr 2022 Taewoon Kim, Michael Cochez, Vincent Francois-Lavet, Mark Neerincx, Piek Vossen

Inspired by the cognitive science theory, we explicitly model an agent with both semantic and episodic memory systems, and show that it is better than having just one of the two memory systems.

OpenAI Gym

EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa

1 code implementation26 Aug 2021 Taewoon Kim, Piek Vossen

We present EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa, a simple yet expressive scheme of solving the ERC (emotion recognition in conversation) task.

Emotion Recognition in Conversation

Would you describe a leopard as yellow? Evaluating crowd-annotations with justified and informative disagreement

no code implementations COLING 2020 Pia Sommerauer, Antske Fokkens, Piek Vossen

We establish an additional, agreement-independent quality metric based on answer-coherence and evaluate it in comparison to existing metrics.

valid World Knowledge

Annotating Perspectives on Vaccination

no code implementations LREC 2020 Roser Morante, Chantal van Son, Isa Maks, Piek Vossen

In this paper we present the Vaccination Corpus, a corpus of texts related to the online vaccination debate that has been annotated with three layers of information about perspectives: attribution, claims and opinions.

Large-scale Cross-lingual Language Resources for Referencing and Framing

no code implementations LREC 2020 Piek Vossen, Filip Ilievski, Marten Postma, Antske Fokkens, Gosse Minnema, Levi Remijnse

In this article, we lay out the basic ideas and principles of the project Framing Situations in the Dutch Language.

Combining Conceptual and Referential Annotation to Study Variation in Framing

no code implementations LREC 2020 Marten Postma, Levi Remijnse, Filip Ilievski, Antske Fokkens, Sam Titarsolej, Piek Vossen

The user can apply two types of annotations: 1) mappings from expressions to frames and frame elements, 2) reference relations from mentions to events and participants of the structured data.

Talking about other people: an endless range of possibilities

1 code implementation WS 2018 Emiel van Miltenburg, Desmond Elliott, Piek Vossen

This taxonomy serves as a reference point to think about how other people should be described, and can be used to classify and compute statistics about labels applied to people.

Text Generation

The Profiling Machine: Active Generalization over Knowledge

no code implementations1 Oct 2018 Filip Ilievski, Eduard Hovy, Qizhe Xie, Piek Vossen

The human mind is a powerful multifunctional knowledge storage and management system that performs generalization, type inference, anomaly detection, stereotyping, and other tasks.

Anomaly Detection Management

Systematic Study of Long Tail Phenomena in Entity Linking

no code implementations COLING 2018 Filip Ilievski, Piek Vossen, Stefan Schlobach

In this paper we report on a series of hypotheses regarding the long tail phenomena in entity linking datasets, their interaction, and their impact on system performance.

Entity Linking

Measuring the Diversity of Automatic Image Descriptions

1 code implementation COLING 2018 Emiel van Miltenburg, Desmond Elliott, Piek Vossen

Automatic image description systems typically produce generic sentences that only make use of a small subset of the vocabulary available to them.

Text Generation

NewsReader at SemEval-2018 Task 5: Counting events by reasoning over event-centric-knowledge-graphs

no code implementations SEMEVAL 2018 Piek Vossen

In this paper, we describe the participation of the NewsReader system in the SemEval-2018 Task 5 on Counting Events and Participants in the Long Tail.

Knowledge Graphs

SemEval-2018 Task 5: Counting Events and Participants in the Long Tail

no code implementations SEMEVAL 2018 Marten Postma, Filip Ilievski, Piek Vossen

This paper discusses SemEval-2018 Task 5: a referential quantification task of counting events and participants in local, long-tail news documents with high ambiguity.

Word Sense Disambiguation

BiographyNet: Extracting Relations Between People and Events

no code implementations22 Jan 2018 Antske Fokkens, Serge ter Braake, Niels Ockeloen, Piek Vossen, Susan Legêne, Guus Schreiber, Victor de Boer

This paper describes BiographyNet, a digital humanities project (2012-2016) that brings together researchers from history, computational linguistics and computer science.

Management

Storyteller: Visual Analytics of Perspectives on Rich Text Interpretations

no code implementations WS 2017 Maarten van Meersbergen, Piek Vossen, Janneke van der Zwaan, Antske Fokkens, Willem van Hage, Inger Leemans, Isa Maks

Complexity of event data in texts makes it difficult to assess its content, especially when considering larger collections in which different sources report on the same or similar situations.

GRaSP: Grounded Representation and Source Perspective

1 code implementation RANLP 2017 Antske Fokkens, Piek Vossen, Marco Rospocher, Rinke Hoekstra, Willem Robert van Hage

When people or organizations provide information, they make choices regarding what information they include and how they present it.

The Circumstantial Event Ontology (CEO)

no code implementations WS 2017 Roxane Segers, Tommaso Caselli, Piek Vossen

In this paper we describe the ongoing work on the Circumstantial Event Ontology (CEO), a newly developed ontology for calamity events that models semantic circumstantial relations between event classes.

The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction

no code implementations WS 2017 Tommaso Caselli, Piek Vossen

This paper reports on the Event StoryLine Corpus (ESC) v1. 0, a new benchmark dataset for the temporal and causal relation detection.

Natural Language Inference Question Answering +2

Cross-linguistic differences and similarities in image descriptions

1 code implementation WS 2017 Emiel van Miltenburg, Desmond Elliott, Piek Vossen

Automatic image description systems are commonly trained and evaluated on large image description datasets.

Specificity

Identity and Granularity of Events in Text

no code implementations13 Apr 2017 Piek Vossen, Agata Cybulska

In this paper we describe a method to detect event descrip- tions in different news articles and to model the semantics of events and their components using RDF representations.

Event Detection

Semantic overfitting: what `world' do we consider when evaluating disambiguation of text?

no code implementations COLING 2016 Filip Ilievski, Marten Postma, Piek Vossen

Semantic text processing faces the challenge of defining the relation between lexical expressions and the world to which they make reference within a period of time.

Relation

More is not always better: balancing sense distributions for all-words Word Sense Disambiguation

1 code implementation COLING 2016 Marten Postma, Ruben Izquierdo Bevia, Piek Vossen

Current Word Sense Disambiguation systems show an extremely poor performance on low frequent senses, which is mainly caused by the difference in sense distributions between training and test data.

Word Sense Disambiguation

GRaSP: A Multilayered Annotation Scheme for Perspectives

no code implementations LREC 2016 Chantal van Son, Tommaso Caselli, Antske Fokkens, Isa Maks, Roser Morante, Lora Aroyo, Piek Vossen

In the last decade, different aspects of linguistic encoding of perspectives have been targeted as separated phenomena through different annotation initiatives.

Relation

The Event and Implied Situation Ontology (ESO): Application and Evaluation

no code implementations LREC 2016 Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau, Anne-Lyse Minard

This paper presents the Event and Implied Situation Ontology (ESO), a manually constructed resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.

Addressing the MFS Bias in WSD systems

no code implementations LREC 2016 Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau, Piek Vossen

Word Sense Disambiguation (WSD) systems tend to have a strong bias towards assigning the Most Frequent Sense (MFS), which results in high performance on the MFS but in a very low performance on the less frequent senses.

Word Sense Disambiguation

NewsReader: recording history from daily news streams

no code implementations LREC 2014 Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving, Willem van Hage

The European project NewsReader develops technology to process daily news streams in 4 languages, extracting what happened, when, where and who was involved.

Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution

no code implementations LREC 2014 Agata Cybulska, Piek Vossen

In this paper we examine the representativeness of the EventCorefBank (ECB, Bejan and Harabagiu, 2010) with regards to the language population of large-volume streams of news.

coreference-resolution Event Coreference Resolution

Discovering and Visualising Stories in News

no code implementations LREC 2014 Marieke van Erp, Gleb Satyukov, Piek Vossen, Marit Nijsen

Daily news streams often revolve around topics that span over a longer period of time such as the global financial crisis or the healthcare debate in the US.

Hope and Fear: How Opinions Influence Factuality

no code implementations LREC 2014 Chantal van Son, Marieke van Erp, Antske Fokkens, Piek Vossen

Both sentiment and event factuality are fundamental information levels for our understanding of events mentioned in news texts.

Opinion Mining Sentiment Analysis

Generating Polarity Lexicons with WordNet propagation in 5 languages

no code implementations LREC 2014 Isa Maks, Ruben Izquierdo, Francesca Frontini, Rodrigo Agerri, Piek Vossen, Andoni Azpeitia

In this paper we focus on the creation of general-purpose (as opposed to domain-specific) polarity lexicons in five languages: French, Italian, Dutch, English and Spanish using WordNet propagation.

Named Entity Recognition (NER) Opinion Mining +1

Building a fine-grained subjectivity lexicon from a web corpus

no code implementations LREC 2012 Isa Maks, Piek Vossen

In this paper we propose a method to build fine-grained subjectivity lexicons including nouns, verbs and adjectives.

Sentiment Analysis Subjectivity Analysis

DutchSemCor: Targeting the ideal sense-tagged corpus

no code implementations LREC 2012 Piek Vossen, Attila G{\"o}r{\"o}g, Rub{\'e}n Izquierdo, Antal Van den Bosch

Word Sense Disambiguation (WSD) systems require large sense-tagged corpora along with lexical databases to reach satisfactory results.

Active Learning Word Sense Disambiguation

Cannot find the paper you are looking for? You can Submit a new open access paper.