no code implementations • 26 May 2020 • Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn
Our empirical evaluation demonstrates that Amrl-Q agents are able to learn a policy and state estimator in parallel during online training.
1 code implementation • 15 Apr 2020 • Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn
Reinforcement learning (RL) has been demonstrated to have great potential in many applications of scientific discovery and design.
1 code implementation • 20 Mar 2019 • Chris Beeler, Uladzimir Yahorau, Rory Coles, Kyle Mills, Stephen Whitelam, Isaac Tamblyn
Gradient-based reinforcement learning is able to learn the Stirling cycle, whereas an evolutionary approach achieves the optimal Carnot cycle.