Search Results for author: Mohammadhosein Hasanbeig

Found 10 papers, 6 papers with code

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

1 code implementation24 Feb 2021 Mingyu Cai, Mohammadhosein Hasanbeig, Shaoping Xiao, Alessandro Abate, Zhen Kan

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces.

Motion Planning OpenAI Gym +2

Shielding Atari Games with Bounded Prescience

1 code implementation20 Jan 2021 Mirco Giacobbe, Mohammadhosein Hasanbeig, Daniel Kroening, Hjalmar Wijk

We present the first exact method for analysing and ensuring the safety of DRL agents for Atari games.

Atari Games Autonomous Driving

Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer

no code implementations6 Jul 2020 Thomas J. Ringstrom, Mohammadhosein Hasanbeig, Alessandro Abate

In Hierarchical Control, compositionality, abstraction, and task-transfer are crucial for designing versatile algorithms which can solve a variety of problems with maximal representational reuse.

Cautious Reinforcement Learning with Logical Constraints

no code implementations26 Feb 2020 Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) to synthesise optimal control policies while ensuring safety during the learning process.

reinforcement-learning Reinforcement Learning (RL)

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

1 code implementation22 Nov 2019 Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate, Tom Melham, Daniel Kroening

This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives.

Hierarchical Reinforcement Learning Montezuma's Revenge +4

Modular Deep Reinforcement Learning with Temporal Logic Specifications

2 code implementations23 Sep 2019 Lim Zun Yuan, Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees

1 code implementation11 Sep 2019 Mohammadhosein Hasanbeig, Yiannis Kantaros, Alessandro Abate, Daniel Kroening, George J. Pappas, Insup Lee

Reinforcement Learning (RL) has emerged as an efficient method of choice for solving complex sequential decision making problems in automatic control, computer science, economics, and biology.

Decision Making Decision Making Under Uncertainty +4

Logically-Constrained Neural Fitted Q-Iteration

no code implementations20 Sep 2018 Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property.

From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning

no code implementations7 Feb 2018 Mohammadhosein Hasanbeig, Lacra Pavel

The main focus of this paper is on enhancement of two types of game-theoretic learning algorithms: log-linear learning and reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Logically-Constrained Reinforcement Learning

1 code implementation24 Jan 2018 Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

With this reward function, the policy synthesis procedure is "constrained" by the given specification.

Decision Making Decision Making Under Uncertainty +4

Cannot find the paper you are looking for? You can Submit a new open access paper.