no code implementations • SemEval (NAACL) 2022 • Wessel Poelman, Gijs Danoe, Esther Ploeger, Frank van den Berg, Tommaso Caselli, Lukas Edman
This paper describes our system created for the SemEval 2022 Task 3: Presupposed Taxonomies - Evaluating Neural-network Semantics.
1 code implementation • COLING 2022 • Wessel Poelman, Rik van Noord, Johan Bos
Even though many recent semantic parsers are based on deep learning methods, we should not forget that rule-based alternatives might offer advantages over neural approaches with respect to transparency, portability, and explainability.
no code implementations • 11 Dec 2024 • Wessel Poelman, Miryam de Lhoneux
In this position paper, we lay out two roles of English in multilingual LM evaluations: as an interface and as a natural language.
no code implementations • 8 Nov 2024 • Kushal Tatariya, Artur Kulmizev, Wessel Poelman, Esther Ploeger, Marcel Bollmann, Johannes Bjerva, Jiaming Luo, Heather Lent, Miryam de Lhoneux
Wikipedia's perceived high quality and broad language coverage have established it as a fundamental resource in multilingual NLP.
1 code implementation • 6 Jul 2024 • Esther Ploeger, Wessel Poelman, Andreas Holck Høeg-Petersen, Anders Schlichtkrull, Miryam de Lhoneux, Johannes Bjerva
We compare sampling methods with a range of metrics and find that our systematic methods consistently retrieve more typologically diverse language selections than previous methods in NLP.
no code implementations • 1 Jul 2024 • Phillip Schneider, Wessel Poelman, Michael Rovatsos, Florian Matthes
Conversational search systems enable information retrieval via natural language interactions, with the goal of maximizing users' information gain over multiple dialogue turns.
2 code implementations • 6 Feb 2024 • Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva
We recommend future work to include an operationalization of 'typological diversity' that empirically justifies the diversity of language samples.
no code implementations • 14 Sep 2023 • Mahdi Dhaini, Wessel Poelman, Ege Erdogan
While recent advancements in the capabilities and widespread accessibility of generative language models, such as ChatGPT (OpenAI, 2022), have brought about various benefits by generating fluent human-like text, the task of distinguishing between human- and large language model (LLM) generated text has emerged as a crucial problem.