Search Results for author: Tim Pearce

Found 17 papers, 12 papers with code

Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing

no code implementations26 Oct 2023 Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

Our contribution is that our decentralised approach is both scalable and considers the self-interested nature of companies.

reinforcement-learning

Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach

no code implementations26 Oct 2023 Stephen Mak, Liming Xu, Tim Pearce, Michael Ostroumov, Alexandra Brintrup

Our contribution is that we are the first to consider both the route allocation problem and gain sharing problem simultaneously - without access to the expensive characteristic function.

Multi-agent Reinforcement Learning reinforcement-learning

Imitating Human Behaviour with Diffusion Models

1 code implementation25 Jan 2023 Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments.

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

1 code implementation12 Jul 2022 Wentse Chen, Shiyu Huang, Yuan Chiang, Tim Pearce, Wei-Wei Tu, Ting Chen, Jun Zhu

We propose Diversity-Guided Policy Optimization (DGPO), an on-policy algorithm that discovers multiple strategies for solving a given task.

reinforcement-learning Reinforcement Learning (RL)

Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis

1 code implementation26 May 2022 Tim Pearce, Jong-Hyeon Jeong, Yichen Jia, Jun Zhu

To offer theoretical insight into our algorithm, we show firstly that it can be interpreted as a form of expectation-maximisation, and secondly that it exhibits a desirable `self-correcting' property.

regression Survival Analysis

Bayesian Autoencoders: Analysing and Fixing the Bernoulli likelihood for Out-of-Distribution Detection

1 code implementation28 Jul 2021 Bang Xiang Yong, Tim Pearce, Alexandra Brintrup

After an autoencoder (AE) has learnt to reconstruct one dataset, it might be expected that the likelihood on an out-of-distribution (OOD) input would be low.

Out-of-Distribution Detection

Understanding Softmax Confidence and Uncertainty

no code implementations9 Jun 2021 Tim Pearce, Alexandra Brintrup, Jun Zhu

It is often remarked that neural networks fail to increase their uncertainty when predicting on data far from the training distribution.

Out of Distribution (OOD) Detection

Counter-Strike Deathmatch with Large-Scale Behavioural Cloning

2 code implementations9 Apr 2021 Tim Pearce, Jun Zhu

This paper describes an AI agent that plays the popular first-person-shooter (FPS) video game `Counter-Strike; Global Offensive' (CSGO) from pixel input.

Behavioural cloning FPS Games

Structured Weight Priors for Convolutional Neural Networks

1 code implementation12 Jul 2020 Tim Pearce, Andrew Y. K. Foong, Alexandra Brintrup

This paper explores the benefits of adding structure to weight priors.

Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

1 code implementation20 Feb 2020 Russell Tsuchida, Tim Pearce, Chris van der Heide, Fred Roosta, Marcus Gallagher

Secondly, and more generally, we analyse the fixed-point dynamics of iterated kernels corresponding to a broad range of activation functions.

Gaussian Processes

Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions

1 code implementation15 May 2019 Tim Pearce, Russell Tsuchida, Mohamed Zaki, Alexandra Brintrup, Andy Neely

A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e. g. summing a periodic and linear kernel can capture seasonal variation with a long term trend.

reinforcement-learning Reinforcement Learning (RL)

Bayesian Neural Network Ensembles

no code implementations27 Nov 2018 Tim Pearce, Mohamed Zaki, Andy Neely

Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset.

Uncertainty in Neural Networks: Approximately Bayesian Ensembling

2 code implementations12 Oct 2018 Tim Pearce, Felix Leibfried, Alexandra Brintrup, Mohamed Zaki, Andy Neely

Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesian.

Bayesian Inference General Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.