no code implementations • 29 Sep 2024 • Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl
Language models frequently inherit societal biases from their training data.
1 code implementation • 29 Jan 2024 • Shahed Masoudian, Cornelia Volaucnik, Markus Schedl, Navid Rekabsaz
Bias mitigation of Language Models has been the topic of many studies with a recent focus on learning separate modules like adapters for on-demand debiasing.
1 code implementation • 23 Jan 2024 • Carolin Holtermann, Markus Frohmann, Navid Rekabsaz, Anne Lauscher
The knowledge encapsulated in a model is the core factor determining its final performance on downstream tasks.
1 code implementation • 2 Oct 2023 • Markus Frohmann, Carolin Holtermann, Shahed Masoudian, Anne Lauscher, Navid Rekabsaz
We introduce ScaLearn, a simple and highly parameter-efficient two-stage MTL method that capitalizes on the knowledge of the source tasks by learning a minimal set of scaling parameters that enable effective transfer to a target task.
1 code implementation • 13 Jun 2023 • Shahed Masoudian, Khaled Koutini, Markus Schedl, Gerhard Widmer, Navid Rekabsaz
In the Acoustic Scene Classification task (ASC), domain shift is mainly caused by different recording devices.
1 code implementation • 26 May 2023 • Nicolò Tamagnone, Selim Fekih, Ximena Contla, Nayid Orozco, Navid Rekabsaz
Accurate and rapid situation analysis during humanitarian crises is critical to delivering humanitarian aid efficiently and is fundamental to humanitarian imperatives and the Leave No One Behind (LNOB) principle.
1 code implementation • 25 May 2023 • George Zerveas, Navid Rekabsaz, Carsten Eickhoff
Sparse annotation poses persistent challenges to training dense retrieval models; for example, it distorts the training signal when unlabeled relevant documents are used spuriously as negatives in contrastive learning.
1 code implementation • 13 Feb 2023 • Deepak Kumar, Oleg Lesota, George Zerveas, Daniel Cohen, Carsten Eickhoff, Markus Schedl, Navid Rekabsaz
Large pre-trained language models contain societal biases and carry along these biases to downstream tasks.
no code implementations • 17 Jan 2023 • Christopher Wimmer, Navid Rekabsaz
Predicting future direction of stock markets using the historical data has been a fundamental component in financial forecasting.
1 code implementation • 10 Oct 2022 • Selim Fekih, Nicolò Tamagnone, Benjamin Minixhofer, Ranjan Shrestha, Ximena Contla, Ewan Oglethorpe, Navid Rekabsaz
Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain.
1 code implementation • 9 Jun 2022 • Christian Ganhör, David Penz, Navid Rekabsaz, Oleg Lesota, Markus Schedl
We conduct experiments on the MovieLens-1M and LFM-2b-DemoBias datasets, and evaluate the effectiveness of the bias mitigation method based on the inability of external attackers in revealing the users' gender information from the model.
1 code implementation • 30 May 2022 • Lukas Hauzenberger, Shahed Masoudian, Deepak Kumar, Markus Schedl, Navid Rekabsaz
Societal biases are reflected in large pre-trained language models and their fine-tuned versions on downstream tasks.
no code implementations • 3 Mar 2022 • Klara Krieg, Emilia Parada-Cabaleiro, Markus Schedl, Navid Rekabsaz
This work investigates the effect of gender-stereotypical biases in the content of retrieved results on the relevance judgement of users/annotators.
1 code implementation • 19 Jan 2022 • Klara Krieg, Emilia Parada-Cabaleiro, Gertraud Medicus, Oleg Lesota, Markus Schedl, Navid Rekabsaz
To facilitate the studies of gender bias in the retrieval results of IR systems, we introduce Gender Representation-Bias for Information Retrieval (Grep-BiasIR), a novel thoroughly-audited dataset consisting of 118 bias-sensitive neutral search queries.
1 code implementation • 16 Dec 2021 • George Zerveas, Navid Rekabsaz, Daniel Cohen, Carsten Eickhoff
Contrastive learning has been the dominant approach to training dense retrieval models.
1 code implementation • NAACL 2022 • Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz
Our method makes training large language models for new languages more accessible and less damaging to the environment.
no code implementations • 16 Aug 2021 • Oleg Lesota, Alessandro B. Melchiorre, Navid Rekabsaz, Stefan Brandl, Dominik Kowald, Elisabeth Lex, Markus Schedl
In this work, in contrast, we propose to investigate popularity differences (between the user profile and recommendation list) in terms of median, a variety of statistical moments, as well as similarity measures that consider the entire popularity distributions (Kullback-Leibler divergence and Kendall's tau rank-order correlation).
1 code implementation • 25 Jun 2021 • Oleg Lesota, Navid Rekabsaz, Daniel Cohen, Klaus Antonius Grasserbauer, Carsten Eickhoff, Markus Schedl
In contrast to the matching paradigm, the probabilistic nature of generative rankers readily offers a fine-grained measure of uncertainty.
1 code implementation • 10 May 2021 • Daniel Cohen, Bhaskar Mitra, Oleg Lesota, Navid Rekabsaz, Carsten Eickhoff
In any ranking system, the retrieval model outputs a single score for a document based on its belief on how relevant it is to a given search query.
1 code implementation • 28 Apr 2021 • Navid Rekabsaz, Simone Kopeinik, Markus Schedl
In this work, we first provide a novel framework to measure the fairness in the retrieved text contents of ranking models.
no code implementations • EACL 2021 • Jenny Paola Yela-Bello, Ewan Oglethorpe, Navid Rekabsaz
To improve this process, effective summarization models are a valuable tool for humanitarian response experts as they provide digestible overviews of essential information in secondary data.
1 code implementation • 14 Mar 2021 • Navid Rekabsaz, Oleg Lesota, Markus Schedl, Jon Brassey, Carsten Eickhoff
As such, the collection is one of the few datasets offering the necessary data richness and scale to train neural IR models with a large amount of parameters, and notably the first in the health domain.
1 code implementation • 1 May 2020 • Navid Rekabsaz, Markus Schedl
Concerns regarding the footprint of societal biases in information retrieval (IR) systems have been raised in several previous studies.
no code implementations • 15 Jan 2020 • Markus Zlabinger, Sebastian Hofstätter, Navid Rekabsaz, Allan Hanbury
While existing disease-symptom relationship extraction methods are used as the foundation in the various medical tasks, no collection is available to systematically evaluate the performance of such methods.
no code implementations • 29 May 2019 • Navid Rekabsaz, Nikolaos Pappas, James Henderson, Banriskhem K. Khonglah, Srikanth Madikeri
In this study, we propose a multilingual neural language model architecture, trained jointly on the domain-specific data of several low-resource languages.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+6
1 code implementation • 29 Apr 2019 • Sebastian Hofstätter, Navid Rekabsaz, Carsten Eickhoff, Allan Hanbury
Low-frequency terms are a recurring challenge for information retrieval models, especially neural IR frameworks struggle with adequately capturing infrequently observed words.
no code implementations • 13 Dec 2018 • Navid Rekabsaz, Robert West, James Henderson, Allan Hanbury
The common approach to measuring such biases using a corpus is by calculating the similarities between the embedding vector of a word (like nurse) and the vectors of the representative words of the concepts of interest (such as genders).
no code implementations • 16 Nov 2017 • Navid Rekabsaz, Mihai Lupu, Allan Hanbury, Andres Duque
We explore the use of unsupervised methods in Cross-Lingual Word Sense Disambiguation (CL-WSD) with the application of English to Persian.
no code implementations • 20 Jul 2017 • Navid Rekabsaz, Bhaskar Mitra, Mihai Lupu, Allan Hanbury
As an alternative, explicit word representations propose vectors whose dimensions are easily interpretable, and recent methods show competitive performance to the dense vectors.
no code implementations • 20 Jun 2016 • Navid Rekabsaz, Mihai Lupu, Allan Hanbury
Word embedding, specially with its recent developments, promises a quantification of the similarity between terms.
1 code implementation • LREC 2016 • Navid Rekabsaz, Serwah Sabetghadam, Mihai Lupu, Linda Andersson, Allan Hanbury
In this paper, we address the shortage of evaluation benchmarks on Persian (Farsi) language by creating and making available a new benchmark for English to Persian Cross Lingual Word Sense Disambiguation (CL-WSD).