Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

A platform for Applied Reinforcement Learning (Applied RL)

PDF Abstract ICML 2018 PDF ICML 2018 Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Continuous Control Lunar Lander (OpenAI Gym) SAC Score 284.59±0.97 # 1

Methods