no code implementations • 1 Dec 2022 • Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski
As a second main contribution of this work, we show how these properties may be implemented together in a proof-of-concept reinforcement learning agent: we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity.
no code implementations • 13 Jun 2022 • Alexandra Kearney, Anna Koop, Johannes Günther, Patrick M. Pilarski
In computational reinforcement learning, a growing body of work seeks to construct an agent's perception of the world through predictions of future sensations; predictions about environment observations are used as additional input features to enable better goal-directed decision-making.
no code implementations • 20 May 2022 • Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski
Curiosity for machine agents has been a focus of intense research.
no code implementations • 18 Nov 2021 • Alex Kearney, Anna Koop, Johannes Günther, Patrick M. Pilarski
In computational reinforcement learning, a growing body of work seeks to express an agent's model of the world through predictions about future sensations.
no code implementations • 27 Oct 2020 • Daniel Graves, Johannes Günther, Jun Luo
General value functions (GVFs) in the reinforcement learning (RL) literature are long-term predictive summaries of the outcomes of agents following specific policies in the environment.
no code implementations • 18 Nov 2019 • Craig Sherstan, Shibhansh Dohare, James Macglashan, Johannes Günther, Patrick M. Pilarski
By using the timescale as one of the estimator's inputs we can estimate value for arbitrary timescales.
no code implementations • 15 Aug 2019 • Johannes Günther, Nadia M. Ady, Alex Kearney, Michael R. Dawson, Patrick M. Pilarski
Predictions and predictive knowledge have seen recent success in improving not only robot control but also other applications ranging from industrial process control to rehabilitation.
no code implementations • 30 May 2019 • Johannes Günther, Elias Reichensdörfer, Patrick M. Pilarski, Klaus Diepold
In this paper, we examine the utility of extending PID controllers with recurrent neural networks-namely, General Dynamic Neural Networks (GDNN); we show that GDNN (neural) PID controllers perform well on a range of control systems and highlight how they can be a scalable and interpretable option for control systems.