Search Results for author: Benjamin Roth

Found 48 papers, 22 papers with code

Knodle: Modular Weakly Supervised Learning with PyTorch

1 code implementation ACL (RepL4NLP) 2021 Anastasiia Sedova, Andreas Stephan, Marina Speranskaya, Benjamin Roth

Strategies for improving the training and prediction quality of weakly supervised machine learning models vary in how much they are tailored to a specific task or integrated with a specific model architecture.

Benchmarking BIG-bench Machine Learning +3

ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision

1 code implementation14 Apr 2022 Anastasiia Sedova, Benjamin Roth

A cost-effective alternative to manual data labeling is weak supervision (WS), where data samples are automatically annotated using a predefined set of labeling functions (LFs), rule-based mechanisms that generate artificial labels for the associated classes.

Denoising Weakly-supervised Learning

Position-aware Self-attention with Relative Positional Encodings for Slot Filling

1 code implementation9 Jul 2018 Ivan Bilan, Benjamin Roth

The self-attention encoder also uses a custom implementation of relative positional encodings which allow each word in the sentence to take into account its left and right context.

Position Relation Extraction +3

Multilingual Relation Extraction using Compositional Universal Schema

1 code implementation NAACL 2016 Patrick Verga, David Belanger, Emma Strubell, Benjamin Roth, Andrew McCallum

In response, this paper introduces significant further improvements to the coverage and flexibility of universal schema relation extraction: predictions for entities unseen in training and multilingual transfer learning to domains with no annotation.

Relation Relation Extraction +4

KnowMAN: Weakly Supervised Multinomial Adversarial Networks

1 code implementation EMNLP 2021 Luisa März, Ehsaneddin Asgari, Fabienne Braune, Franziska Zimmermann, Benjamin Roth

The knowledge is captured in labeling functions, which detect certain regularities or patterns in the training samples and annotate corresponding labels for training.

Language Modelling Weakly-supervised Learning

Interpretable Textual Neuron Representations for NLP

2 code implementations WS 2018 Nina Poerner, Benjamin Roth, Hinrich Schütze

Input optimization methods, such as Google Deep Dream, create interpretable representations of neurons for computer vision DNNs.

Text-Guided Image Clustering

1 code implementation5 Feb 2024 Andreas Stephan, Lukas Miklautz, Kevin Sidak, Jan Philip Wahle, Bela Gipp, Claudia Plant, Benjamin Roth

We, therefore, propose Text-Guided Image Clustering, i. e., generating text using image captioning and visual question-answering (VQA) models and subsequently clustering the generated text.

Clustering Image Captioning +3

Ranking vs. Classifying: Measuring Knowledge Base Completion Quality

1 code implementation AKBC 2020 Marina Speranskaya, Martin Schmitt, Benjamin Roth

We randomly remove some of these correct answers from the data set, simulating the realistic scenario of real-world entities missing from a KB.

Knowledge Base Completion Model Selection

ACTC: Active Threshold Calibration for Cold-Start Knowledge Graph Completion

1 code implementation10 May 2023 Anastasiia Sedova, Benjamin Roth

In this paper, we attempt for the first time cold-start calibration for KGC, where no annotated examples exist initially for calibration, and only a limited number of tuples can be selected for annotation.

Knowledge Graph Completion Relation

MemeGraphs: Linking Memes to Knowledge Graphs

1 code implementation28 May 2023 Vasiliki Kougia, Simon Fetzel, Thomas Kirchmair, Erion Çano, Sina Moayed Baharlou, Sahand Sharifzadeh, Benjamin Roth

In this work, we propose to use scene graphs, that express images in terms of objects and their visual relations, and knowledge graphs as structured representations for meme classification with a Transformer-based architecture.

Entity Linking Knowledge Graphs +1

Learning with Noisy Labels by Adaptive Gradient-Based Outlier Removal

1 code implementation7 Jun 2023 Anastasiia Sedova, Lena Zellinger, Benjamin Roth

Instead of cleaning the dataset prior to model training, the dataset is dynamically adjusted during the training process.

Denoising Learning with noisy labels

WeaNF: Weak Supervision with Normalizing Flows

2 code implementations28 Apr 2022 Andreas Stephan, Benjamin Roth

In this work, we explore a novel direction of generative modeling for weak supervision: Instead of modeling the output of the annotation process (the labeling function matches), we generatively model the input-side data distributions (the feature space) covered by labeling functions.

XPASC: Measuring Generalization in Weak Supervision by Explainability and Association

1 code implementation3 Jun 2022 Luisa März, Ehsaneddin Asgari, Fabienne Braune, Franziska Zimmermann, Benjamin Roth

To verify this assumption, we introduce a novel method, XPASC (eXPlainability-Association SCore), for measuring the generalization of a model trained with a weakly supervised dataset.

ReInform: Selecting paths with reinforcement learning for contextualized link prediction

1 code implementation19 Nov 2022 Marina Speranskaya, Sameh Methias, Benjamin Roth

We propose to use reinforcement learning to inform transformer-based contextualized link prediction models by providing paths that are most useful for predicting the correct answer.

Link Prediction reinforcement-learning +1

Counterfactual Reasoning with Knowledge Graph Embeddings

1 code implementation11 Mar 2024 Lena Zellinger, Andreas Stephan, Benjamin Roth

We further observe that KGEs adapted with COULDD solidly detect plausible counterfactual changes to the graph that follow these patterns.

counterfactual Counterfactual Reasoning +2

Neural Architectures for Open-Type Relation Argument Extraction

no code implementations5 Mar 2018 Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e. g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e. g. X: the title of a book or a work of art) from the corpus.

Question Answering Relation +2

Comparing Convolutional Neural Networks to Traditional Models for Slot Filling

no code implementations NAACL 2016 Heike Adel, Benjamin Roth, Hinrich Schütze

We address relation classification in the context of slot filling, the task of finding and evaluating fillers like "Steve Jobs" for the slot X in "X founded Apple".

Classification General Classification +5

Compositional Vector Space Models for Knowledge Base Completion

no code implementations IJCNLP 2015 Arvind Neelakantan, Benjamin Roth, Andrew McCallum

Knowledge base (KB) completion adds new facts to a KB by making inferences from existing facts, for example by inferring with high likelihood nationality(X, Y) from bornIn(X, Y).

Knowledge Base Completion Relation +1

Assessing Wikipedia-Based Cross-Language Retrieval Models

no code implementations10 Jan 2014 Benjamin Roth

For a combined model, another interesting question is therefore how to integrate different weighting schemes.

Language Modelling Machine Translation +2

Effective Slot Filling Based on Shallow Distant Supervision Methods

no code implementations6 Jan 2014 Benjamin Roth, Tassilo Barth, Michael Wiegand, Mittul Singh, Dietrich Klakow

In the TAC KBP 2013 English Slotfilling evaluation, the submitted main run of the LSV RelationFactory system achieved the top-ranked F1-score of 37. 3%.

Relation Relation Extraction +4

UniSent: Universal Adaptable Sentiment Lexica for 1000+ Languages

no code implementations LREC 2020 Ehsaneddin Asgari, Fabienne Braune, Benjamin Roth, Christoph Ringlstetter, Mohammad R. K. Mofrad

We introduce a method called DomDrift to mitigate the huge domain mismatch between Bible and Twitter by a confidence weighting scheme that uses domain-specific embeddings to compare the nearest neighbors for a candidate sentiment word in the source (Bible) and target (Twitter) domain.

Sentiment Analysis Unsupervised Domain Adaptation

Domain adaptation for part-of-speech tagging of noisy user-generated text

no code implementations NAACL 2019 Luisa März, Dietrich Trautmann, Benjamin Roth

We propose an architecture that trains an out-of-domain model on a large newswire corpus, and transfers those weights by using them as a prior for a model trained on the target domain (a data-set of German Tweets) for which there is very little an-notations available.

Domain Adaptation Part-Of-Speech Tagging +3

Interpretable Question Answering on Knowledge Bases and Text

no code implementations ACL 2019 Alona Sydorova, Nina Poerner, Benjamin Roth

Our results suggest that IP provides better explanations than LIME or attention, according to both automatic and human evaluation.

Question Answering

Intent Recognition in Doctor-Patient Interviews

no code implementations LREC 2020 Robin Rojowiec, Benjamin Roth, Maximilian Fink

For some intent classes, the data only contains a few samples, and we apply Information Retrieval and Deep Learning methods that are robust with respect to small amounts of training data for recognizing the intent of an utterance and providing the correct response.

Information Retrieval Intent Recognition +1

Dirichlet-Smoothed Word Embeddings for Low-Resource Settings

no code implementations LREC 2020 Jakob Jungmaier, Nora Kassner, Benjamin Roth

We evaluate on standard word similarity data sets and compare to word2vec and the recent state of the art for low-resource settings: Positive and Unlabeled (PU) Learning for word embeddings.

Word Embeddings Word Similarity

Focused Contrastive Training for Test-based Constituency Analysis

no code implementations30 Sep 2021 Benjamin Roth, Erion Çano

We propose a scheme for self-training of grammaticality models for constituency analysis based on linguistic tests.

Language Modelling Position +1

Checking HateCheck: a cross-functional analysis of behaviour-aware learning for hate speech detection

1 code implementation nlppower (ACL) 2022 Pedro Henrique Luz de Araujo, Benjamin Roth

Behavioural testing -- verifying system capabilities by validating human-designed input-output pairs -- is an alternative evaluation method of natural language processing systems proposed to address the shortcomings of the standard approach: computing metrics on held-out data.

Hate Speech Detection

Is the Computation of Abstract Sameness Relations Human-Like in Neural Language Models?

no code implementations12 May 2022 Lukas Thoma, Benjamin Roth

In recent years, deep neural language models have made strong progress in various NLP tasks.

Language Acquisition

WeaNF”:" Weak Supervision with Normalizing Flows

no code implementations RepL4NLP (ACL) 2022 Andreas Stephan, Benjamin Roth

In this work, we explore a novel direction of generative modeling for weak supervision”:" Instead of modeling the output of the annotation process (the labeling function matches), we generatively model the input-side data distributions (the feature space) covered by labeling functions.

Topic Segmentation of Research Article Collections

no code implementations18 May 2022 Erion Çano, Benjamin Roth

In this work, we perform topic segmentation of a paper data collection that we crawled and produce a multitopic dataset of roughly seven million paper data records.

named-entity-recognition Named Entity Recognition +3

SepLL: Separating Latent Class Labels from Weak Supervision Noise

1 code implementation25 Oct 2022 Andreas Stephan, Vasiliki Kougia, Benjamin Roth

In this work, we provide a method for learning from weak labels by separating two types of complementary information associated with the labeling functions: information related to the target label and information specific to one labeling function only.

text-classification Text Classification +1

Cross-functional Analysis of Generalisation in Behavioural Learning

1 code implementation22 May 2023 Pedro Henrique Luz de Araujo, Benjamin Roth

In behavioural testing, system functionalities underrepresented in the standard evaluation setting (with a held-out test set) are validated through controlled input-output pairs.

Paraphrase Identification Reading Comprehension +1

Functionality learning through specification instructions

no code implementations14 Nov 2023 Pedro Henrique Luz de Araujo, Benjamin Roth

A core aspect of our analysis is to measure the effect that including a set of specifications has on a held-out set of unseen, qualitatively different specifications.

Fairness

Specification Overfitting in Artificial Intelligence

no code implementations13 Mar 2024 Benjamin Roth, Pedro Henrique Luz de Araujo, Yuxi Xia, Saskia Kaltenbrunner, Christoph Korab

Machine learning (ML) and artificial intelligence (AI) approaches are often criticized for their inherent bias and for their lack of control, accountability, and transparency.

Fairness

Cannot find the paper you are looking for? You can Submit a new open access paper.