Search Results for author: Aivar Sootla

Found 12 papers, 3 papers with code

Effects of Safety State Augmentation on Safe Exploration

1 code implementation6 Jun 2022 Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

We further show that Simmer can stabilize training and improve the performance of safe RL with average constraints.

Reinforcement Learning (RL) Safe Exploration +1

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

no code implementations31 May 2022 David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

In this paper, we introduce a reinforcement learning (RL) framework named \textbf{L}earnable \textbf{I}mpulse \textbf{C}ontrol \textbf{R}einforcement \textbf{A}lgorithm (LICRA), for learning to optimally select both when to act and which actions to take when actions incur costs.

Reinforcement Learning (RL)

SEREN: Knowing When to Explore and When to Exploit

no code implementations30 May 2022 Changmin Yu, David Mguni, Dong Li, Aivar Sootla, Jun Wang, Neil Burgess

Efficient reinforcement learning (RL) involves a trade-off between "exploitative" actions that maximise expected reward and "explorative'" ones that sample unvisited states.

Reinforcement Learning (RL)

Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

1 code implementation14 Feb 2022 Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Satisfying safety constraints almost surely (or with probability one) can be critical for the deployment of Reinforcement Learning (RL) in real-life applications.

reinforcement-learning Reinforcement Learning (RL) +1

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

no code implementations ICLR 2022 Hang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar

We consider a context-dependent Reinforcement Learning (RL) setting, which is characterized by: a) an unknown finite number of not directly observable contexts; b) abrupt (discontinuous) context changes occurring during an episode; and c) Markovian context evolution.

reinforcement-learning Reinforcement Learning (RL) +1

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

no code implementations27 Oct 2021 David Mguni, Usman Islam, Yaqi Sun, Xiuling Zhang, Joel Jennings, Aivar Sootla, Changmin Yu, Ziyan Wang, Jun Wang, Yaodong Yang

In this paper, we introduce a new generation of RL solvers that learn to minimise safety violations while maximising the task reward to the extent that can be tolerated by the safe policy.

OpenAI Gym reinforcement-learning +3

Viscos Flows: Variational Schur Conditional Sampling With Normalizing Flows

no code implementations6 Jul 2021 Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang

We present a method for conditional sampling for pre-trained normalizing flows when only part of an observation is available.

SAMBA: Safe Model-Based & Active Reinforcement Learning

1 code implementation12 Jun 2020 Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar

In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics.

Reinforcement Learning (RL) Safe Reinforcement Learning

Distributed Reconstruction of Nonlinear Networks: An ADMM Approach

no code implementations28 Mar 2014 Wei Pan, Aivar Sootla, Guy-Bart Stan

In this paper, we present a distributed algorithm for the reconstruction of large-scale nonlinear networks.

Time Series Time Series Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.