no code implementations • 19 Feb 2021 • Derya Aksaray, Yasin Yazicioglu, Ahmet Semi Asarkaya
We propose a novel constrained reinforcement learning method for finding optimal policies in Markov Decision Processes while satisfying temporal logic constraints with a desired probability throughout the learning process.
Robotics Systems and Control Systems and Control