no code implementations • 14 Dec 2023 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback).
no code implementations • 7 Nov 2023 • Akshay Dhonthi, Marcello Eiermann, Ernst Moritz Hahn, Vahid Hashemi
One prominent application is image recognition in autonomous driving, where the correct classification of objects, such as traffic signs, is essential for safe driving.
no code implementations • 14 Aug 2023 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success.
no code implementations • 14 Dec 2022 • Akshay Dhonthi, Ernst Moritz Hahn, Vahid Hashemi
Deep Neural Networks (DNN) are becoming increasingly more important in assisted and automated driving.
no code implementations • 23 Jun 2022 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
Recursion is the fundamental paradigm to finitely describe potentially infinite objects.
no code implementations • 6 May 2022 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
The surprising answer is that we have to pay significantly less when we instead expand the good-for-MDP property to alternating automata: like the nondeterministic GFM automata obtained from deterministic Rabin automata, the alternating good-for-MDP automata we produce from deterministic Streett automata are bi-linear in the the size of the deterministic automaton and its index, and can therefore be exponentially more succinct than minimal nondeterministic B\"uchi automata.
no code implementations • 16 Jun 2021 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
Reinforcement learning synthesizes controllers without prior knowledge of the system.
no code implementations • 12 Jun 2021 • Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak
We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs).