Search Results for author: Matej Vecerik

Found 3 papers, 2 papers with code

Safe Exploration in Continuous Action Spaces

6 code implementations • 26 Jan 2018 • Gal Dalal, Krishnamurthy Dvijotham, Matej Vecerik, Todd Hester, Cosmin Paduraru, Yuval Tassa

We address the problem of deploying a reinforcement learning (RL) agent on a physical system such as a datacenter cooling unit or robot, where critical constraints must never be violated.

Reinforcement Learning (RL) Safe Exploration

517

Paper
Code

Deep Q-learning from Demonstrations

5 code implementations • 12 Apr 2017 • Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

We present an algorithm, Deep Q-learning from Demonstrations (DQfD), that leverages small sets of demonstration data to massively accelerate the learning process even from relatively small amounts of demonstration data and is able to automatically assess the necessary ratio of demonstration data while learning thanks to a prioritized replay mechanism.

Imitation Learning Q-Learning +1

2,548

Paper
Code

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

no code implementations • ICLR 2018 • Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin Riedmiller

Solving this difficult and practically relevant problem in the real world is an important long-term goal for the field of robotics.

Continuous Control Q-Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.