Safe reinforcement learning is extremely challenging--not only must the agent explore an unknown environment, it must do so while ensuring no safety constraint violations.

Paper
Code

GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

Data-Science-in-Mechanical-Engineering/Contextual-GoSafe • 24 Jan 2022

Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage.

Paper
Code

CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

rl-boxes/safe-rl • • 15 Feb 2022

Although using bounds as surrogate functions to design safe RL algorithms have appeared in some existing works, we develop them at least three aspects: (i) We provide a rigorous theoretical analysis to extend the surrogate functions to generalized advantage estimator (GAE).

Paper
Code

Effects of Safety State Augmentation on Safe Exploration

huawei-noah/hebo • • 6 Jun 2022

We further show that Simmer can stabilize training and improve the performance of safe RL with average constraints.

Paper
Code

Toward Safe and Accelerated Deep Reinforcement Learning for Next-Generation Wireless Networks

ahmadnagib/SARL-RRM • • 16 Sep 2022

Nevertheless, several challenges hinder the practical adoption of DRL in commercial networks.

Paper
Code

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

fujitsuresearch/safeexploration • • 30 Sep 2022

We define the safety during learning as satisfaction of the constraint conditions explicitly defined in terms of the state and propose a safe exploration method that uses partial prior knowledge of a controlled object and disturbance.

Paper
Code

Near-Optimal Multi-Agent Learning for Safe Coverage Control

manish-pra/safemac • • 12 Oct 2022

In this paper, we aim to efficiently learn the density to approximately solve the coverage problem while preserving the agents' safety.

Paper
Code

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm

akjayant/mbppol • • 14 Oct 2022

We compare our approach with relevant model-free and model-based approaches in Constrained RL using the challenging Safe Reinforcement Learning benchmark - the Open AI Safety Gym.

Paper
Code

Atlas: Automate Online Service Configuration in Network Slicing

int-unl/atlas • • 30 Oct 2022

First, we design a learning-based simulator to reduce the sim-to-real discrepancy, which is accomplished by a new parameter searching method based on Bayesian optimization.

Paper
Code

Safe Exploration

Benchmarks Add a Result

Libraries

Most implemented papers

Content

Benchmarks

Add a Result