Although using bounds as surrogate functions to design safe RL algorithms have appeared in some existing works, we develop them at least three aspects: (i) We provide a rigorous theoretical analysis to extend the surrogate functions to generalized advantage estimator (GAE).

15 Feb 2022

Paper
Code

GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Data-Science-in-Mechanical-Engineering/Contextual-GoSafe • 24 Jan 2022

Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage.

24 Jan 2022

Paper
Code

DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning

archanabura/dope-doublyoptimisticpessimisticexploration • 1 Dec 2021

Safe reinforcement learning is extremely challenging--not only must the agent explore an unknown environment, it must do so while ensuring no safety constraint violations.

01 Dec 2021

Paper
Code

Safe Policy Optimization with Local Generalized Linear Function Approximations

akifumi-wachi-4/spolf • NeurIPS 2021

Safe exploration is a key to applying reinforcement learning (RL) in safety-critical systems.

09 Nov 2021

Paper
Code

Infinite Time Horizon Safety of Bayesian Neural Networks

mlech26l/bayesian_nn_safety • • NeurIPS 2021

Bayesian neural networks (BNNs) place distributions over the weights of a neural network to model uncertainty in the data and the network's prediction.

04 Nov 2021

Paper
Code

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

metadriverse/metadrive • 26 Sep 2021

Based on MetaDrive, we construct a variety of RL tasks and baselines in both single-agent and multi-agent settings, including benchmarking generalizability across unseen scenes, safe exploration, and learning multi-agent traffic.

615

26 Sep 2021

Paper
Code

Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety

zlr20/saferl_kit • • 22 May 2021

The safety constraints commonly used by existing safe reinforcement learning (RL) methods are defined only on expectation of initial states, but allow each certain state to be unsafe, which is unsatisfying for real-world safety-critical tasks.

22 May 2021

Paper
Code

Safe Continuous Control with Constrained Model-Based Policy Optimization

anyboby/mujoco_safety_gym • 14 Apr 2021

Further, we provide theoretical and empirical analyses regarding the implications of model-usage on constrained policy optimization problems and introduce a practical algorithm that accelerates policy search with model-generated data.

14 Apr 2021

Paper
Code

Autonomous UAV Exploration of Dynamic Environments via Incremental Sampling and Probabilistic Roadmap

Zhefan-Xu/DEP • 14 Oct 2020

Autonomous exploration requires robots to generate informative trajectories iteratively.

14 Oct 2020

Paper
Code

Safe Exploration

Benchmarks Add a Result

Libraries

Latest papers

Content

Benchmarks

Add a Result