continuous-control

424 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find continuous-control models and implementations

Most implemented papers

Continuous control with deep reinforcement learning

ray-project/ray 9 Sep 2015

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain.

Simple random search provides a competitive approach to reinforcement learning

modestyachts/ARS 19 Mar 2018

A common belief in model-free reinforcement learning is that methods based on random search in the parameter space of policies exhibit significantly worse sample complexity than those that explore the space of actions.

Conservative Q-Learning for Offline Reinforcement Learning

aviralkumar2907/CQL NeurIPS 2020

We theoretically show that CQL produces a lower bound on the value of the current policy and that it can be incorporated into a policy learning procedure with theoretical improvement guarantees.

High-Dimensional Continuous Control Using Generalized Advantage Estimation

labmlai/annotated_deep_learning_paper_implementations 8 Jun 2015

Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks.

Benchmarking Deep Reinforcement Learning for Continuous Control

rllab/rllab 22 Apr 2016

Recently, researchers have made significant progress combining the advances in deep learning for learning feature representations with reinforcement learning.

Parameter Space Noise for Exploration

tensorflow/models ICLR 2018

Combining parameter noise with traditional RL methods allows to combine the best of both worlds.

Off-Policy Deep Reinforcement Learning without Exploration

sfujim/BCQ 7 Dec 2018

Many practical applications of reinforcement learning constrain agents to learn from a fixed batch of data which has already been gathered, without offering further possibility for data collection.

Learning Latent Dynamics for Planning from Pixels

google-research/planet 12 Nov 2018

Planning has been very successful for control tasks with known environment dynamics.