no code implementations • SIGDIAL (ACL) 2021 • Shamila Nasreen, Julian Hough, Matthew Purver
Alzheimer’s Disease (AD) is associated with many characteristic changes, not only in an individual’s language but also in the interactive patterns observed in dialogue.
no code implementations • EACL (Hackashop) 2021 • Andraž Pelicon, Ravi Shekhar, Matej Martinc, Blaž Škrlj, Matthew Purver, Senja Pollak
We present a system for zero-shot cross-lingual offensive language and hate speech classification.
no code implementations • ReInAct 2021 • Jorge Del-Bosque-Trevino, Julian Hough, Matthew Purver
We annotate a corpus of analogical episodes with the schema and develop statistical sequence models from the corpus which predict tutor content related decisions, in terms of the selection of the analogical component (AC) and tutor conversational management act (TCMA) to deploy at the current utterance, given the student’s behaviour.
no code implementations • SIGDIAL (ACL) 2021 • Mladen Karan, Prashant Khare, Patrick Healey, Matthew Purver
This work revisits the task of detecting decision-related utterances in multi-party dialogue.
no code implementations • EACL (Hackashop) 2021 • Senja Pollak, Marko Robnik-Šikonja, Matthew Purver, Michele Boggia, Ravi Shekhar, Marko Pranjić, Salla Salmela, Ivar Krustok, Tarmo Paju, Carl-Gustav Linden, Leo Leppänen, Elaine Zosa, Matej Ulčar, Linda Freienthal, Silver Traat, Luis Adrián Cabrera-Diego, Matej Martinc, Nada Lavrač, Blaž Škrlj, Martin Žnidaršič, Andraž Pelicon, Boshko Koloski, Vid Podpečan, Janez Kranjc, Shane Sheehan, Emanuela Boros, Jose G. Moreno, Antoine Doucet, Hannu Toivonen
This paper presents tools and data sources collected and released by the EMBEDDIA project, supported by the European Union’s Horizon 2020 research and innovation program.
1 code implementation • SemEval (NAACL) 2022 • Thi Hong Hanh Tran, Matej Martinc, Matthew Purver, Senja Pollak
The reverse dictionary task is a sequence-to-vector task in which a gloss is provided as input, and the output must be a semantically matching word vector.
no code implementations • CSRNLP (LREC) 2022 • Matthew Purver, Matej Martinc, Riste Ichev, Igor Lončarski, Katarina Sitar Šuštar, Aljoša Valentinčič, Senja Pollak
We describe initial work into analysing the language used around environmental, social and governance (ESG) issues in UK company annual reports.
no code implementations • 22 Jan 2025 • Zahraa Al Sahili, Ioannis Patras, Matthew Purver
As large-scale vision-language models (VLMs) become increasingly central to modern AI applications, understanding and mitigating social biases in these systems has never been more critical. We investigate how dataset composition, model size, and multilingual training affect gender and racial bias in a popular VLM, CLIP, and its open-source variants.
no code implementations • 25 Nov 2024 • Iacopo Ghinassi, Lin Wang, Chris Newell, Matthew Purver
In this survey, we provide an extensive overview of current advances in linear text segmentation, describing the state of the art in terms of resources and approaches for the task.
no code implementations • 30 Sep 2024 • Luka Andrenšek, Boshko Koloski, Andraž Pelicon, Nada Lavrač, Senja Pollak, Matthew Purver
We investigate zero-shot cross-lingual news sentiment detection, aiming to develop robust sentiment classifiers that can be deployed across multiple languages without target-language training data.
1 code implementation • 9 Sep 2024 • Yujian Gan, Changling Li, Jinxia Xie, Luou Wen, Matthew Purver, Massimo Poesio
The benchmark includes 31 different task types, each with 10 unique dialogue scenarios between information seeker and provider agents.
no code implementations • 3 Aug 2024 • Peyman Hosseini, Ignacio Castro, Iacopo Ghinassi, Matthew Purver
Large Language Models (LLMs) have demonstrated remarkable capabilities in comprehending and analyzing lengthy sequential inputs, owing to their extensive context windows that allow processing millions of tokens in a single forward pass.
no code implementations • 23 Jul 2024 • Zahraa Al Sahili, Ioannis Patras, Matthew Purver
The application of machine learning (ML) in detecting, diagnosing, and treating mental health disorders is garnering increasing attention.
no code implementations • 13 Jun 2024 • Zahraa Al Sahili, Ioannis Patras, Matthew Purver
In the domain of text-to-image generative models, biases inherent in training datasets often propagate into generated content, posing significant ethical challenges, particularly in socially sensitive contexts.
no code implementations • 10 Apr 2024 • Jaya Caporusso, Damar Hoogland, Mojca Brglez, Boshko Koloski, Matthew Purver, Senja Pollak
Dehumanisation involves the perception and or treatment of a social group's members as less than human.
no code implementations • 15 Oct 2023 • Dimitris Gkoumas, Matthew Purver, Maria Liakata
Here, we automatically learn linguistic disorder patterns by making use of a moderately-sized pre-trained language model and forcing it to focus on reformulated natural language processing (NLP) tasks and associated linguistic patterns.
1 code implementation • 4 Mar 2023 • Peyman Hosseini, Mehran Hosseini, Sana Sabah Al-Azzawi, Marcus Liwicki, Ignacio Castro, Matthew Purver
We study the influence of different activation functions in the output layer of deep neural network models for soft and hard label prediction in the learning with disagreement task.
no code implementations • 11 Nov 2022 • Ravi Shekhar, Mladen Karan, Matthew Purver
In light of unprecedented increases in the popularity of the internet and social media, comment moderation has never been a more relevant task.
no code implementations • LREC 2022 • Pakawat Nakwijit, Matthew Purver
User-generated content is full of misspellings.
1 code implementation • Findings (NAACL) 2022 • Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver
To deal with this problem, we modify a number of state-of-the-art models to train on the segmented data of Spider-SS, and we show that this method improves the generalization performance.
1 code implementation • RANLP 2021 • Elaine Zosa, Ravi Shekhar, Mladen Karan, Matthew Purver
Moderation of reader comments is a significant problem for online news platforms.
1 code implementation • EMNLP 2021 • Yujian Gan, Xinyun Chen, Matthew Purver
Recently, there has been significant progress in studying neural networks for translating text descriptions into SQL queries under the zero-shot cross-domain setting.
3 code implementations • Findings (EMNLP) 2021 • Yujian Gan, Xinyun Chen, Jinxia Xie, Matthew Purver, John R. Woodward, John Drake, Qiaofu Zhang
Addressing the mismatch between natural language descriptions and the corresponding SQL queries is a key challenge for text-to-SQL translation.
no code implementations • 3 Sep 2021 • Dimitris Gkoumas, Bo wang, Adam Tsakalidis, Maria Wolters, Arkaitz Zubiaga, Matthew Purver, Maria Liakata
The corpus consists of spoken conversations, a subset of which are transcribed, as well as typed and written thoughts and associated extra-linguistic information such as pen strokes and keystrokes.
no code implementations • 22 Jul 2021 • Matej Ulčar, Aleš Žagar, Carlos S. Armendariz, Andraž Repar, Senja Pollak, Matthew Purver, Marko Robnik-Šikonja
The current dominance of deep neural networks in natural language processing is based on contextual embeddings such as ELMo, BERT, and BERT derivatives.
no code implementations • 29 Jun 2021 • Morteza Rohanian, Julian Hough, Matthew Purver
We present two multimodal fusion-based deep learning models that consume ASR transcribed speech and acoustic data simultaneously to classify whether a speaker in a structured diagnostic task has Alzheimer's Disease and to what degree, evaluating the ADReSSo challenge 2021 data.
1 code implementation • 17 Jun 2021 • Morteza Rohanian, Julian Hough, Matthew Purver
This paper is a submission to the Alzheimer's Dementia Recognition through Spontaneous Speech (ADReSS) challenge, which aims to develop methods that can assist in the automated prediction of severity of Alzheimer's Disease from speech data.
1 code implementation • ACL 2021 • Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward, Jinxia Xie, Pengsheng Huang
We observe that the accuracy dramatically drops by eliminating such explicit correspondence between NL questions and table schemas, even if the synonyms are not adversarially selected to conduct worst-case adversarial attacks.
no code implementations • SEMEVAL 2020 • Carlos Santos Armendariz, Matthew Purver, Senja Pollak, Nikola Ljube{\v{s}}i{\'c}, Matej Ul{\v{c}}ar, Ivan Vuli{\'c}, Mohammad Taher Pilehvar
This paper presents the Graded Word Similarity in Context (GWSC) task which asked participants to predict the effects of context on human perception of similarity in English, Croatian, Slovene and Finnish.
no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Yujian Gan, Matthew Purver, John R. Woodward
WikiSQL and Spider, the large-scale cross-domain text-to-SQL datasets, have attracted much attention from the research community.
no code implementations • EMNLP (NLP-COVID19) 2020 • Tom Tabak, Matthew Purver
We describe a set of experiments for building a temporal mental health dynamics system.
1 code implementation • 2 Apr 2020 • Jey Han Lau, Carlos S. Armendariz, Shalom Lappin, Matthew Purver, Chang Shu
We study the influence of context on sentence acceptability.
no code implementations • TACL 2020 • Jey Han Lau, Carlos Armendariz, Shalom Lappin, Matthew Purver, Chang Shu
We study the influence of context on sentence acceptability.
1 code implementation • LREC 2020 • Carlos Santos Armendariz, Matthew Purver, Matej Ulčar, Senja Pollak, Nikola Ljubešić, Marko Robnik-Šikonja, Mark Granroth-Wilding, Kristiina Vaik
State of the art natural language processing tools are built on context-dependent word embeddings, but no direct method for evaluating these representations currently exists.
no code implementations • 1 Nov 2018 • Mehrnoosh Sadrzadeh, Matthew Purver, Julian Hough, Ruth Kempson
One of the fundamental requirements for models of semantic processing in dialogue is incrementality: a model must reflect how people interpret and generate language at least on a word-by-word basis, and handle phenomena such as fragments, incomplete and jointly-produced utterances.
no code implementations • WS 2017 • Sophie Chesney, Maria Liakata, Massimo Poesio, Matthew Purver
This paper discusses the problem of incongruent headlines: those which do not accurately represent the information contained in the article with which they occur.
no code implementations • 4 Aug 2016 • Stephen McGregor, Matthew Purver, Geraint Wiggins
This paper presents a geometric approach to the problem of modelling the relationship between words and concepts, focusing in particular on analogical phenomena in language and cognition.
no code implementations • EMNLP 2014 • Julian Hough, Matthew Purver
We present STIR (STrongly Incremental Repair detection), a system that detects speech repairs and edit terms on transcripts incrementally with minimal latency.
no code implementations • EMNLP 2014 • Dmitrijs Milajevs, Dimitri Kartsaklis, Mehrnoosh Sadrzadeh, Matthew Purver
We provide a comparative study between neural word representations and traditional vector spaces based on co-occurrence counts, in a number of compositional tasks.