Search Results for author: Nick Hawes

Found 18 papers, 8 papers with code

Monte Carlo Tree Search with Boltzmann Exploration

2 code implementations NeurIPS 2023 Michael Painter, Mohamed Baioumy, Nick Hawes, Bruno Lacerda

Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound applied to Trees (UCT), are instrumental to automated planning techniques.

Game of Go

Effects of Explanation Specificity on Passengers in Autonomous Driving

no code implementations2 Jul 2023 Daniel Omeiza, Raunak Bhattacharyya, Nick Hawes, Marina Jirotka, Lars Kunze

In this paper, we investigate the effects of natural language explanations' specificity on passengers in autonomous driving.

Autonomous Driving Explanation Generation +1

A Framework for Learning from Demonstration with Minimal Human Effort

1 code implementation15 Jun 2023 Marc Rigter, Bruno Lacerda, Nick Hawes

In this setting we address reinforcement learning, and learning from demonstration, where there is a cost associated with human time.

reinforcement-learning

DITTO: Offline Imitation Learning with World Models

no code implementations6 Feb 2023 Branton DeMoss, Paul Duckworth, Nick Hawes, Ingmar Posner

We propose DITTO, an offline imitation learning algorithm which uses world models and on-policy reinforcement learning to addresses the problem of covariate shift, without access to an oracle or any additional online interactions.

Imitation Learning reinforcement-learning +1

RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

2 code implementations26 Apr 2022 Marc Rigter, Bruno Lacerda, Nick Hawes

Model-based algorithms, which learn a model of the environment from the dataset and perform conservative policy optimisation within that model, have emerged as a promising approach to this problem.

Offline RL reinforcement-learning +1

Planning for Risk-Aversion and Expected Value in MDPs

1 code implementation25 Oct 2021 Marc Rigter, Paul Duckworth, Bruno Lacerda, Nick Hawes

This motivates us to propose a lexicographic approach which minimises the expected cost subject to the constraint that the CVaR of the total cost is optimal.

On Solving a Stochastic Shortest-Path Markov Decision Process as Probabilistic Inference

no code implementations13 Sep 2021 Mohamed Baioumy, Bruno Lacerda, Paul Duckworth, Nick Hawes

Previous work on planning as active inference addresses finite horizon problems and solutions valid for online planning.

valid

Towards Stochastic Fault-tolerant Control using Precision Learning and Active Inference

no code implementations13 Sep 2021 Mohamed Baioumy, Corrado Pezzato, Carlos Hernandez Corbato, Nick Hawes, Riccardo Ferrari

This work presents a fault-tolerant control scheme for sensory faults in robotic manipulators based on active inference.

Minimax Regret Optimisation for Robust Planning in Uncertain Markov Decision Processes

no code implementations8 Dec 2020 Marc Rigter, Bruno Lacerda, Nick Hawes

We propose a dynamic programming algorithm that utilises the regret Bellman equation, and show that it optimises minimax regret exactly for UMDPs with independent uncertainties.

Active Inference for Integrated State-Estimation, Control, and Learning

1 code implementation12 May 2020 Mohamed Baioumy, Paul Duckworth, Bruno Lacerda, Nick Hawes

This work presents an approach for control, state-estimation and learning model (hyper)parameters for robotic manipulators.

Robotics

Convex Hull Monte-Carlo Tree Search

no code implementations9 Mar 2020 Michael Painter, Bruno Lacerda, Nick Hawes

This work investigates Monte-Carlo planning for agents in stochastic environments, with multiple objectives.

Multi-Armed Bandits

Artificial Intelligence for Long-Term Robot Autonomy: A Survey

no code implementations13 Jul 2018 Lars Kunze, Nick Hawes, Tom Duckett, Marc Hanheide, Tomáš Krajník

Autonomous systems will play an essential role in many applications across diverse domains including space, marine, air, field, road, and service robotics.

Simultaneous Task Allocation and Planning Under Uncertainty

no code implementations7 Mar 2018 Fatma Faruq, Bruno Lacerda, Nick Hawes, David Parker

We propose novel techniques for task allocation and planning in multi-robot systems operating in uncertain environments.

Learning Deep Visual Object Models From Noisy Web Data: How to Make it Work

1 code implementation IEEE Xplore: 2017 Nizar Massouh, Francesca Babiloni, Tatiana Tommasi, Jay Young, Nick Hawes, Barbara Caputo

We contribute to this research thread with two findings: (1) a study correlating a given level of noisily labels to the expected drop in accuracy, for two deep architectures, on two different types of noise, that clearly identifies GoogLeNet as a suitable architecture for learning from Web data; (2) a recipe for the creation of Web datasets with minimal noise and maximum visual variability, based on a visual and natural language processing concept expansion strategy.

Object Object Categorization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.