Search Results for author: Michael Färber

Found 54 papers, 29 papers with code

RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering

no code implementations20 Jun 2025 Ines Besrour, Jingbo He, Tobias Schreieder, Michael Färber

Central to the framework is a hybrid retrieval strategy that combines sparse and dense methods, improving Recall@20 by 12. 5% compared to the best single retrieval model, resulting in more correct and well-supported answers.

Paths to Causality: Finding Informative Subgraphs Within Knowledge Graphs for Knowledge-Based Causal Discovery

1 code implementation10 Jun 2025 Yuni Susanti, Michael Färber

Inferring causal relationships between variable pairs is crucial for understanding multivariate interactions in complex systems.

Causal Discovery Causal Inference +2

SimplifyMyText: An LLM-Based System for Inclusive Plain Language Text Simplification

no code implementations19 Apr 2025 Michael Färber, Parisa Aghdam, Kyuri Im, Mario Tawfelis, Hardik Ghoshal

Text simplification is essential for making complex content accessible to diverse audiences who face comprehension challenges.

Text Simplification

Can LLMs Leverage Observational Data? Towards Data-Driven Causal Discovery with LLMs

no code implementations15 Apr 2025 Yuni Susanti, Michael Färber

Specifically, we examine whether LLMs can effectively utilize observational data through two prompting strategies: pairwise prompting and breadth first search (BFS)-based prompting.

Causal Discovery

Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence

no code implementations1 Mar 2025 Zhan Qu, Shuzhou Yuan, Michael Färber, Marius Brennfleck, Niklas Wartha, Anton Stephan

Wake vortices - strong, coherent air turbulences created by aircraft - pose a significant risk to aviation safety and therefore require accurate and reliable detection methods.

Clustering Decision Making +2

Revisiting Projection-based Data Transfer for Cross-Lingual Named Entity Recognition in Low-Resource Languages

1 code implementation30 Jan 2025 Andrei Politov, Oleh Shkalikov, René Jäkel, Michael Färber

These findings highlight the robustness of projection-based data transfer as an alternative to model-based methods for crosslingual named entity recognition in lowresource languages.

Cross-Lingual NER named-entity-recognition +3

Hallucinations Can Improve Large Language Models in Drug Discovery

no code implementations23 Jan 2025 Shuzhou Yuan, Michael Färber

In this paper, we come up with the hypothesis that hallucinations can improve LLMs in drug discovery.

Drug Discovery Hallucination

Incorporating Quantum Advantage in Quantum Circuit Generation through Genetic Programming

1 code implementation16 Jan 2025 Christoph Stein, Michael Färber

Designing efficient quantum circuits that leverage quantum advantage compared to classical computing has become increasingly critical.

Quantum Circuit Generation

Graph-Guided Textual Explanation Generation Framework

no code implementations16 Dec 2024 Shuzhou Yuan, Jingyi Sun, Ran Zhang, Michael Färber, Steffen Eger, Pepa Atanasova, Isabelle Augenstein

Specifically, highlight explanations are extracted as highly faithful cues representing the model's reasoning and are subsequently encoded through a graph neural network layer, which explicitly guides the NLE generation process.

Explanation Generation Graph Neural Network

The Effects of Hallucinations in Synthetic Training Data for Relation Extraction

no code implementations10 Oct 2024 Steven Rogulsky, Nicholas Popovic, Michael Färber

However, this approach often introduces hallucinations, such as spurious facts, whose impact on relation extraction remains underexplored.

Data Augmentation Knowledge Graphs +3

Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery

1 code implementation26 Jul 2024 Yuni Susanti, Michael Färber

In this paper, we investigate the capabilities of Small Language Models (SLMs, defined as LLMs with fewer than 1 billion parameters) with prompt-based learning for knowledge-based causal discovery.

Causal Discovery Knowledge Graphs

AutoRDF2GML: Facilitating RDF Integration in Graph Machine Learning

1 code implementation26 Jul 2024 Michael Färber, David Lamprecht, Yuni Susanti

In this paper, we introduce AutoRDF2GML, a framework designed to convert RDF data into data representations tailored for graph machine learning tasks.

Graph Classification Knowledge Graphs +2

ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering

1 code implementation7 Jun 2024 Raphael Gruber, Abdelrahman Abdallah, Michael Färber, Adam Jatowt

We introduce ComplexTempQA, a large-scale dataset consisting of over 100 million question-answer pairs designed to tackle the challenges in temporal question answering.

Information Retrieval Question Answering

GreeDy and CoDy: Counterfactual Explainers for Dynamic Graphs

no code implementations25 Mar 2024 Zhan Qu, Daniel Gomm, Michael Färber

Temporal Graph Neural Networks (TGNNs), crucial for modeling dynamic graphs with time-varying interactions, face a significant challenge in explainability due to their complex model structure.

counterfactual Counterfactual Explanation +1

Embedded Named Entity Recognition using Probing Classifiers

2 code implementations18 Mar 2024 Nicholas Popovič, Michael Färber

Streaming text generation has become a common way of increasing the responsiveness of language model powered applications, such as chat assistants.

Decoder Fact Checking +10

Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models

no code implementations28 Feb 2024 Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze

Despite the predominance of English in their training data, English-centric Large Language Models (LLMs) like GPT-3 and LLaMA display a remarkable ability to perform multilingual tasks, raising questions about the depth and nature of their cross-lingual capabilities.

Part-Of-Speech Tagging Sentence

Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers

no code implementations18 Feb 2024 Shuzhou Yuan, Ercong Nie, Bolei Ma, Michael Färber

Large Language Models (LLMs) possess outstanding capabilities in addressing various natural language processing (NLP) tasks.

text-classification Text Classification

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

1 code implementation29 Jan 2024 Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze

However, most previous studies primarily focused on sentence-level classification tasks, and only a few considered token-level labeling tasks such as Named Entity Recognition (NER) and Part-of-Speech (POS) tagging.

Benchmarking In-Context Learning +8

Analyzing the Impact of Companies on AI Research Based on Publications

1 code implementation31 Oct 2023 Michael Färber, Lazaros Tampakis

Artificial Intelligence (AI) is one of the most momentous technologies of our time.

Linked Papers With Code: The Latest in Machine Learning as an RDF Knowledge Graph

1 code implementation31 Oct 2023 Michael Färber, David Lamprecht

In this paper, we introduce Linked Papers With Code (LPWC), an RDF knowledge graph that provides comprehensive, current information about almost 400, 000 machine learning publications.

Knowledge Graph Embeddings

A Full-fledged Commit Message Quality Checker Based on Machine Learning

1 code implementation9 Sep 2023 David Faragó, Michael Färber, Christian Petrov

By considering all rules from the most popular CM quality guideline, creating datasets for those rules, and training and evaluating state-of-the-art machine learning models to check those rules, we can answer the research question with: sufficiently well for practice, with the lowest F$_1$ score of 82. 9\%, for the most challenging task.

Vocab-Expander: A System for Creating Domain-Specific Vocabularies Based on Word Embeddings

no code implementations7 Aug 2023 Michael Färber, Nicholas Popovic

In this paper, we propose Vocab-Expander at https://vocab-expander. com, an online tool that enables end-users (e. g., technology scouts) to create and expand a vocabulary of their domain of interest.

Common Sense Reasoning Information Retrieval +3

Measuring Variety, Balance, and Disparity: An Analysis of Media Coverage of the 2021 German Federal Election

no code implementations7 Aug 2023 Michael Färber, Jannik Schwade, Adam Jatowt

Determining and measuring diversity in news articles is important for a number of reasons, including preventing filter bubbles and fueling public discourse, especially before elections.

Articles Diversity

SemOpenAlex: The Scientific Landscape in 26 Billion RDF Triples

no code implementations7 Aug 2023 Michael Färber, David Lamprecht, Johan Krause, Linn Aung, Peter Haase

We present SemOpenAlex, an extensive RDF knowledge graph that contains over 26 billion triples about scientific publications and their associated entities, such as authors, institutions, journals, and concepts.

Recommendation Systems

Evaluating Generative Models for Graph-to-Text Generation

1 code implementation27 Jul 2023 Shuzhou Yuan, Michael Färber

Large language models (LLMs) have been widely employed for graph-to-text generation tasks.

Descriptive Text Generation

CoCon: A Data Set on Combined Contextualized Research Artifact Use

1 code implementation27 Mar 2023 Tarek Saier, Youxiang Dong, Michael Färber

To enable more holistic analyses and systems dealing with academic publications and their content, we propose CoCon, a large scholarly data set reflecting the combined use of research artifacts, contextualized in academic publications' full-text.

Link Prediction Prediction

unarXive 2022: All arXiv Publications Pre-Processed for NLP, Including Structured Full-Text and Citation Network

1 code implementation27 Mar 2023 Tarek Saier, Johan Krause, Michael Färber

Large-scale data sets on scholarly publications are the basis for a variety of bibliometric analyses and natural language processing (NLP) applications.

All Citation Recommendation

Biases in Scholarly Recommender Systems: Impact, Prevalence, and Mitigation

no code implementations18 Jan 2023 Michael Färber, Melissa Coutinho, Shuzhou Yuan

With the remarkable increase in the number of scientific entities such as publications, researchers, and scientific topics, and the associated information overload in science, academic recommender systems have become increasingly important for millions of researchers and science enthusiasts.

Recommendation Systems

Predicting Companies' ESG Ratings from News Articles Using Multivariate Timeseries Analysis

no code implementations13 Nov 2022 Tanja Aue, Adam Jatowt, Michael Färber

Environmental, social and governance (ESG) engagement of companies moved into the focus of public attention over recent years.

Articles

Are Investors Biased Against Women? Analyzing How Gender Affects Startup Funding in Europe

no code implementations1 Dec 2021 Michael Färber, Alexander Klein

For startup founders, it is therefore crucial to know whether investors have a bias against women as startup founders and in which way startups face disadvantages due to gender bias.

Towards Full-Fledged Argument Search: A Framework for Extracting and Clustering Arguments from Unstructured Text

1 code implementation30 Nov 2021 Michael Färber, Anna Steyer

We suggest (1) to combine the keyword search with precomputed topic clusters for argument-query matching, (2) to apply a novel approach based on sentence-level sequence-labeling for argument identification, and (3) to present aggregated arguments to users based on topic-aware argument clustering.

Clustering Sentence

Explaining Convolutional Neural Networks by Tagging Filters

no code implementations20 Sep 2021 Anna Nguyen, Daniel Hagenmayer, Tobias Weller, Michael Färber

Finally, we show that the tags are helpful in analyzing classification errors caused by noisy input images and that the tags can be further processed by machines.

Classification image-classification +1

Safe, Fast, Concurrent Proof Checking for the lambda-Pi Calculus Modulo Rewriting

no code implementations17 Feb 2021 Michael Färber

Several proof assistants, such as Isabelle or Coq, can concurrently check multiple proofs.

Logic in Computer Science

Right for the Right Reason: Making Image Classification Robust

no code implementations23 Jul 2020 Anna Nguyen, Adrian Oberföll, Michael Färber

To this end, we propose a new explanation quality metric to measure object aligned explanation in image classification which we refer to as theObAlExmetric.

Classification General Classification +5

Semantic Modelling of Citation Contexts for Context-aware Citation Recommendation

1 code implementation ECIR 2020 Tarek Saier, Michael Färber

New research is being published at a rate, at which it is infeasible for many scholars to read and assess everything possibly relevant to their work.

Citation Recommendation

unarXive: A Large Scholarly Data Set with Publications' Full-Text, Annotated In-Text Citations, and Links to Metadata

1 code implementation Scientometrics 2020 Tarek Saier, Michael Färber

The data set, which is made freely available for research purposes, not only can enhance the future evaluation of research paper-based and citation context-based approaches, but also serve as a basis for new ways to analyze in-text citations, as we show prototypically in this article.

Citation Recommendation Document Summarization +3

Citation Recommendation: Approaches and Datasets

1 code implementation17 Feb 2020 Michael Färber, Adam Jatowt

In recent years, several approaches and evaluation data sets have been presented.

Articles Citation Recommendation +1

HybridCite: A Hybrid Model for Context-Aware Citation Recommendation

3 code implementations15 Feb 2020 Michael Färber, Ashwath Sampath

The process of recommending citations for citation contexts is called local citation recommendation and is the focus of this paper.

Citation Recommendation Information Retrieval +2

Making Neural Networks FAIR

1 code implementation26 Jul 2019 Anna Nguyen, Tobias Weller, Michael Färber, York Sure-Vetter

In this paper, we first present the neural network ontology FAIRnets Ontology, an ontology to make existing neural network models findable, accessible, interoperable, and reusable according to the FAIR principles.

Linked Crunchbase: A Linked Data API and RDF Data Set About Innovative Companies

1 code implementation19 Jul 2019 Michael Färber

Crunchbase is an online platform collecting information about startups and technology companies, including attributes and relations of companies, people, and investments.

Which Knowledge Graph Is Best for Me?

no code implementations28 Sep 2018 Michael Färber, Achim Rettinger

Furthermore, we proposed a framework for finding the most suitable knowledge graph for a given setting.

Knowledge Graphs Survey

Monte Carlo Tableau Proof Search

no code implementations18 Nov 2016 Michael Färber, Cezary Kaliszyk, Josef Urban

We study Monte Carlo Tree Search to guide proof search in tableau calculi.

Automated Theorem Proving

Internal Guidance for Satallax

no code implementations30 May 2016 Michael Färber, Chad Brown

We evaluated our method on a simply-typed higher-order logic version of the Flyspeck project, where it solves 26% more problems than Satallax without internal guidance.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.