Search Results for author: Thomas Gabor

Found 29 papers, 9 papers with code

Qandle: Accelerating State Vector Simulation Using Gate-Matrix Caching and Circuit Splitting

1 code implementation • 14 Apr 2024 • Gerhard Stenzel, Sebastian Zielinski, Michael Kölle, Philipp Altmann, Jonas Nüßlein, Thomas Gabor

To address the computational complexity associated with state-vector simulation for quantum circuits, we propose a combination of advanced techniques to accelerate circuit execution.

Paper
Code

REACT: Revealing Evolutionary Action Consequence Trajectories for Interpretable Reinforcement Learning

no code implementations • 4 Apr 2024 • Philipp Altmann, Céline Davignon, Maximilian Zorn, Fabian Ritz, Claudia Linnhoff-Popien, Thomas Gabor

To enhance the interpretability of Reinforcement Learning (RL), we propose Revealing Evolutionary Action Consequence Trajectories (REACT).

Descriptive reinforcement-learning +1

Paper
Add Code

Challenges for Reinforcement Learning in Quantum Circuit Design

1 code implementation • 18 Dec 2023 • Philipp Altmann, Jonas Stein, Michael Kölle, Adelina Bärligea, Thomas Gabor, Thomy Phan, Sebastian Feld, Claudia Linnhoff-Popien

Quantum computing (QC) in the current NISQ era is still limited in size and precision.

Quantum Machine Learning reinforcement-learning +1

Paper
Code

Towards Transfer Learning for Large-Scale Image Classification Using Annealing-based Quantum Boltzmann Machines

no code implementations • 27 Nov 2023 • Daniëlle Schuman, Leo Sünkel, Philipp Altmann, Jonas Stein, Christoph Roch, Thomas Gabor, Claudia Linnhoff-Popien

Quantum Transfer Learning (QTL) recently gained popularity as a hybrid quantum-classical approach for image classification tasks by efficiently combining the feature extraction capabilities of large Convolutional Neural Networks with the potential benefits of Quantum Machine Learning (QML).

Classification Computed Tomography (CT) +3

Paper
Add Code

DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training

no code implementations • 18 Jan 2023 • Philipp Altmann, Thomy Phan, Fabian Ritz, Thomas Gabor, Claudia Linnhoff-Popien

We propose discriminative reward co-training (DIRECT) as an extension to deep reinforcement learning algorithms.

Imitation Learning

Paper
Add Code

Constructing Organism Networks from Collaborative Self-Replicators

no code implementations • 20 Dec 2022 • Steffen Illium, Maximilian Zorn, Cristian Lenta, Michael Kölle, Claudia Linnhoff-Popien, Thomas Gabor

We introduce organism networks, which function like a single neural network but are composed of several neural particle networks; while each particle network fulfils the role of a single weight application within the organism network, it is also trained to self-replicate its own weights.

Paper
Add Code

Empirical Analysis of Limits for Memory Distance in Recurrent Neural Networks

no code implementations • 20 Dec 2022 • Steffen Illium, Thore Schillman, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien

Common to all different kinds of recurrent neural networks (RNNs) is the intention to model relations between data points through time.

Paper
Add Code

Capturing Dependencies within Machine Learning via a Formal Process Model

no code implementations • 10 Aug 2022 • Fabian Ritz, Thomy Phan, Andreas Sedlmeier, Philipp Altmann, Jan Wieghardt, Reiner Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien, Thomas Gabor

We define a comprehensive SD process model for ML that encompasses most tasks and artifacts described in the literature in a consistent way.

Paper
Add Code

Black Box Optimization Using QUBO and the Cross Entropy Method

1 code implementation • 24 Jun 2022 • Jonas Nüßlein, Christoph Roch, Thomas Gabor, Jonas Stein, Claudia Linnhoff-Popien, Sebastian Feld

A common approach to realising BBO is to learn a surrogate model which approximates the target black-box function which can then be solved via white-box optimization methods.

Paper
Code

Case-Based Inverse Reinforcement Learning Using Temporal Coherence

1 code implementation • 12 Jun 2022 • Jonas Nüßlein, Steffen Illium, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien

As a prior, we assume that the higher-level strategy is to reach an unknown target state area, which we hypothesize is a valid prior for many domains in Reinforcement Learning.

Imitation Learning reinforcement-learning +2

Paper
Code

VAST: Value Function Factorization with Variable Agent Sub-Teams

1 code implementation • NeurIPS 2021 • Thomy Phan, Fabian Ritz, Lenz Belzner, Philipp Altmann, Thomas Gabor, Claudia Linnhoff-Popien

We evaluate VAST in three multi-agent domains and show that VAST can significantly outperform state-of-the-art VFF, when the number of agents is sufficiently large.

Multi-agent Reinforcement Learning

Paper
Code

Solving Large Steiner Tree Problems in Graphs for Cost-Efficient Fiber-To-The-Home Network Expansion

no code implementations • 22 Sep 2021 • Tobias Müller, Kyrill Schmid, Daniëlle Schuman, Thomas Gabor, Markus Friedrich, Marc Geitz

The expansion of Fiber-To-The-Home (FTTH) networks creates high costs due to expensive excavation procedures.

Evolutionary Algorithms Steiner Tree Problem

Paper
Add Code

SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning

no code implementations • 14 Dec 2020 • Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien

A characteristic of reinforcement learning is the ability to develop unforeseen strategies when solving problems.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating the Machine Learning Pipeline

no code implementations • 29 Apr 2020 • Thomas Gabor, Leo Sünkel, Fabian Ritz, Thomy Phan, Lenz Belzner, Christoph Roch, Sebastian Feld, Claudia Linnhoff-Popien

We discuss the synergetic connection between quantum computing and artificial intelligence.

BIG-bench Machine Learning

Paper
Add Code

Insights on Training Neural Networks for QUBO Tasks

no code implementations • 29 Apr 2020 • Thomas Gabor, Sebastian Feld, Hila Safi, Thomy Phan, Claudia Linnhoff-Popien

Current hardware limitations restrict the potential when solving quadratic unconstrained binary optimization (QUBO) problems via the quantum approximate optimization algorithm (QAOA) or quantum annealing (QA).

Traveling Salesman Problem

Paper
Add Code

Cross Entropy Hyperparameter Optimization for Constrained Problem Hamiltonians Applied to QAOA

2 code implementations • 11 Mar 2020 • Christoph Roch, Alexander Impertro, Thomy Phan, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien

Such algorithms are usually implemented in a variational form, combining a classical optimization method with a quantum machine to find good solutions to an optimization problem.

Quantum Physics

Paper
Code

Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning

no code implementations • 31 Dec 2019 • Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien

We further present a first viable solution for calculating a dynamic classification threshold, based on the uncertainty distribution of the training data.

Bayesian Inference Classification +4

Paper
Add Code

Integration and Evaluation of Quantum Accelerators for Data-Driven User Functions

no code implementations • 12 Dec 2019 • Thomas Hubregtsen, Christoph Segler, Josef Pichlmeier, Aritra Sarkar, Thomas Gabor, Koen Bertels

In this work we propose a system architecture for the integration of quantum accelerators.

Paper
Add Code

Benchmarking Surrogate-Assisted Genetic Recommender Systems

no code implementations • 8 Aug 2019 • Thomas Gabor, Philipp Altmann

The surrogate is used to recommend new items to the user, which are then evaluated according to the user's liking and subsequently removed from the search space.

Benchmarking Evolutionary Algorithms +1

Paper
Add Code

Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning

1 code implementation • 11 Jul 2019 • Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien

We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a general memory bounded approach to partially observable open-loop planning.

Thompson Sampling

Paper
Code

Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning

no code implementations • 10 May 2019 • Carsten Hahn, Thomy Phan, Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien

In nature, flocking or swarm behavior is observed in many species as it has beneficial properties like reducing the probability of being caught by a predator.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies

no code implementations • 25 Jan 2019 • Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien

We experimentally evaluate STEP in two challenging and stochastic domains with large state and joint action spaces and show that STEP is able to learn stronger policies than standard multi-agent reinforcement learning algorithms, when combining multi-agent open-loop planning with centralized function approximation.

Decision Making Multi-agent Reinforcement Learning

Paper
Add Code

Uncertainty-Based Out-of-Distribution Detection in Deep Reinforcement Learning

no code implementations • 8 Jan 2019 • Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien

Although prior work has shown that dropout-based variational inference techniques and bootstrap-based approaches can be used to model epistemic uncertainty, the suitability for detecting OOD samples in deep reinforcement learning remains an open question.

Bayesian Inference Open-Ended Question Answering +4

Paper
Add Code

Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms

no code implementations • 30 Oct 2018 • Thomas Gabor, Lenz Belzner, Thomy Phan, Kyrill Schmid

As automatic optimization techniques find their way into industrial applications, the behavior of many complex systems is determined by some form of planner picking the right actions to optimize a given objective function.

Evolutionary Algorithms

Paper
Add Code

Inheritance-Based Diversity Measures for Explicit Convergence Control in Evolutionary Algorithms

no code implementations • 30 Oct 2018 • Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien

Diversity is an important factor in evolutionary algorithms to prevent premature convergence towards a single local optimum.

Evolutionary Algorithms

Paper
Add Code

Genealogical Distance as a Diversity Estimate in Evolutionary Algorithms

no code implementations • 27 Apr 2017 • Thomas Gabor, Lenz Belzner

The evolutionary edit distance between two individuals in a population, i. e., the amount of applications of any genetic operator it would take the evolutionary process to generate one individual starting from the other, seems like a promising estimate for the diversity between said individuals.

Evolutionary Algorithms

Paper
Add Code

Bayesian Verification under Model Uncertainty

1 code implementation • 28 Feb 2017 • Lenz Belzner, Thomas Gabor

We propose such a definition of subjective satisfaction based on recently introduced satisfaction functions.

Paper
Code

Stacked Thompson Bandits

1 code implementation • 28 Feb 2017 • Lenz Belzner, Thomas Gabor

We introduce Stacked Thompson Bandits (STB) for efficiently generating plans that are likely to satisfy a given bounded temporal logic requirement.

Thompson Sampling

Paper
Code

QoS-Aware Multi-Armed Bandits

no code implementations • 28 Feb 2017 • Lenz Belzner, Thomas Gabor

Motivated by runtime verification of QoS requirements in self-adaptive and self-organizing systems that are able to reconfigure their structure and behavior in response to runtime data, we propose a QoS-aware variant of Thompson sampling for multi-armed bandits.

Decision Making Multi-Armed Bandits +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.