Search Results for author: Dominik Baumann

Found 18 papers, 10 papers with code

Safe reinforcement learning in uncertain contexts

1 code implementation11 Jan 2024 Dominik Baumann, Thomas B. Schön

In this work, we drop this assumption and show how we can perform safe learning when we cannot directly measure the context variables.

Multi-class Classification reinforcement-learning +1

A computationally lightweight safe learning algorithm

no code implementations7 Sep 2023 Dominik Baumann, Krzysztof Kowalczyk, Koen Tiels, Paweł Wachel

Unfortunately, Gaussian process inference scales cubically with the number of data points, limiting applicability to high-dimensional and embedded systems.

Gaussian Processes

On the trade-off between event-based and periodic state estimation under bandwidth constraints

no code implementations2 Apr 2023 Dominik Baumann, Thomas B. Schön

In this article, we discuss, for a specific example, when the additional complexity of event-based methods is beneficial.

Scheduling

Towards remote fault detection by analyzing communication priorities

no code implementations30 Sep 2022 Alexander Gräfe, Dominik Baumann, Sebastian Trimpe

We propose a fault detection method that uses these priorities to detect errors in other agents.

Fault Detection

GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

1 code implementation24 Jan 2022 Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage.

Safe Exploration

GoSafe: Globally Optimal Safe Robot Learning

1 code implementation27 May 2021 Dominik Baumann, Alonso Marco, Matteo Turchetta, Sebastian Trimpe

When learning policies for robotic systems from data, safety is a major concern, as violation of safety constraints may cause hardware damage.

Bayesian Optimization

Scaling Beyond Bandwidth Limitations: Wireless Control With Stability Guarantees Under Overload

no code implementations16 Apr 2021 Fabian Mager, Dominik Baumann, Carsten Herrmann, Sebastian Trimpe, Marco Zimmerling

An important class of cyber-physical systems relies on multiple agents that jointly perform a task by coordinating their actions over a wireless network.

Self-Driving Cars

Robot Learning with Crash Constraints

1 code implementation16 Oct 2020 Alonso Marco, Dominik Baumann, Majid Khadiv, Philipp Hennig, Ludovic Righetti, Sebastian Trimpe

We consider failing behaviors as those that violate a constraint and address the problem of learning with crash constraints, where no data is obtained upon constraint violation.

Bayesian Optimization

A Kernel Two-sample Test for Dynamical Systems

no code implementations23 Apr 2020 Friedrich Solowjow, Dominik Baumann, Christian Fiedler, Andreas Jocham, Thomas Seel, Sebastian Trimpe

Evaluating whether data streams are drawn from the same distribution is at the heart of various machine learning problems.

Anomaly Detection Feature Engineering +1

Classified Regression for Bayesian Optimization: Robot Learning with Unknown Penalties

no code implementations24 Jul 2019 Alonso Marco, Dominik Baumann, Philipp Hennig, Sebastian Trimpe

Learning robot controllers by minimizing a black-box objective cost using Bayesian optimization (BO) can be time-consuming and challenging.

Bayesian Optimization regression

Deep Reinforcement Learning for Event-Triggered Control

1 code implementation13 Sep 2018 Dominik Baumann, Jia-Jie Zhu, Georg Martius, Sebastian Trimpe

Event-triggered control (ETC) methods can achieve high-performance control with a significantly lower number of samples compared to usual, time-triggered methods.

reinforcement-learning Reinforcement Learning (RL)

Event-triggered Learning for Resource-efficient Networked Control

no code implementations5 Mar 2018 Friedrich Solowjow, Dominik Baumann, Jochen Garcke, Sebastian Trimpe

Common event-triggered state estimation (ETSE) algorithms save communication in networked control systems by predicting agents' behavior, and transmitting updates only when the predictions deviate significantly.

Cannot find the paper you are looking for? You can Submit a new open access paper.