Search Results for author: Ruediger Ehlers

Found 3 papers, 2 papers with code

Safe Reinforcement Learning via Shielding

1 code implementation29 Aug 2017 Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu

In the first one, the shield acts each time the learning agent is about to make a decision and provides a list of safe actions.

reinforcement-learning Reinforcement Learning (RL) +1

Formal Verification of Piece-Wise Linear Feed-Forward Neural Networks

1 code implementation3 May 2017 Ruediger Ehlers

We present a specialized verification algorithm that employs this approximation in a search process in which it infers additional node phases for the non-linear nodes in the network from partial node phase assignments, similar to unit propagation in classical SAT solving.

Collision Avoidance Handwritten Digit Recognition

Correct-by-synthesis reinforcement learning with temporal logic constraints

no code implementations5 Mar 2015 Min Wen, Ruediger Ehlers, Ufuk Topcu

We establish both correctness (with respect to the temporal logic specifications) and optimality (with respect to the a priori unknown performance criterion) of this two-step technique for a fragment of temporal logic specifications.

Motion Planning Q-Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.