Search Results for author: Devon Jarvis

Found 5 papers, 1 papers with code

Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies

2 code implementations NeurIPS 2023 Michael Beukman, Devon Jarvis, Richard Klein, Steven James, Benjamin Rosman

To this end, we introduce a neural network architecture, the Decision Adapter, which generates the weights of an adapter module and conditions the behaviour of an agent on the context information.

reinforcement-learning

Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning

no code implementations25 May 2022 Geraud Nangue Tasse, Devon Jarvis, Steven James, Benjamin Rosman

The agent can then flexibly compose them both logically and temporally to provably achieve temporal logic specifications in any regular language, such as regular fragments of linear temporal logic.

Continuous Control reinforcement-learning +1

Using Objective Bayesian Methods to Determine the Optimal Degree of Curvature within the Loss Landscape

no code implementations25 Sep 2019 Devon Jarvis, Richard Klein, Benjamin Rosman

The efficacy of the width of the basin of attraction surrounding a minimum in parameter space as an indicator for the generalizability of a model parametrization is a point of contention surrounding the training of artificial neural networks, with the dominant view being that wider areas in the landscape reflect better generalizability by the trained model.

Cannot find the paper you are looking for? You can Submit a new open access paper.