Search Results for author: Mohammadhosein Hasanbeig

Found 10 papers, 6 papers with code

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic

1 code implementation • 24 Feb 2021 • Mingyu Cai, Mohammadhosein Hasanbeig, Shaoping Xiao, Alessandro Abate, Zhen Kan

This paper investigates the motion planning of autonomous dynamical systems modeled by Markov decision processes (MDP) with unknown transition probabilities over continuous state and action spaces.

Motion Planning OpenAI Gym +2

Paper
Code

Shielding Atari Games with Bounded Prescience

1 code implementation • 20 Jan 2021 • Mirco Giacobbe, Mohammadhosein Hasanbeig, Daniel Kroening, Hjalmar Wijk

We present the first exact method for analysing and ensuring the safety of DRL agents for Atari games.

Atari Games Autonomous Driving

Paper
Code

Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer

no code implementations • 6 Jul 2020 • Thomas J. Ringstrom, Mohammadhosein Hasanbeig, Alessandro Abate

In Hierarchical Control, compositionality, abstraction, and task-transfer are crucial for designing versatile algorithms which can solve a variety of problems with maximal representational reuse.

Paper
Add Code

Cautious Reinforcement Learning with Logical Constraints

no code implementations • 26 Feb 2020 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) to synthesise optimal control policies while ensuring safety during the learning process.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

1 code implementation • 22 Nov 2019 • Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate, Tom Melham, Daniel Kroening

This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives.

Hierarchical Reinforcement Learning Montezuma's Revenge +4

Paper
Code

Modular Deep Reinforcement Learning with Temporal Logic Specifications

2 code implementations • 23 Sep 2019 • Lim Zun Yuan, Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees

1 code implementation • 11 Sep 2019 • Mohammadhosein Hasanbeig, Yiannis Kantaros, Alessandro Abate, Daniel Kroening, George J. Pappas, Insup Lee

Reinforcement Learning (RL) has emerged as an efficient method of choice for solving complex sequential decision making problems in automatic control, computer science, economics, and biology.

Decision Making Decision Making Under Uncertainty +4

Paper
Code

Logically-Constrained Neural Fitted Q-Iteration

no code implementations • 20 Sep 2018 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

We propose a method for efficient training of Q-functions for continuous-state Markov Decision Processes (MDPs) such that the traces of the resulting policies satisfy a given Linear Temporal Logic (LTL) property.

Paper
Add Code

From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning

no code implementations • 7 Feb 2018 • Mohammadhosein Hasanbeig, Lacra Pavel

The main focus of this paper is on enhancement of two types of game-theoretic learning algorithms: log-linear learning and reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Logically-Constrained Reinforcement Learning

1 code implementation • 24 Jan 2018 • Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

With this reward function, the policy synthesis procedure is "constrained" by the given specification.

Decision Making Decision Making Under Uncertainty +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.