Search Results for author: Neil Walton

Found 9 papers, 2 papers with code

Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need

1 code implementation3 Feb 2024 Shangda Yang, Vitaly Zankin, Maximilian Balandat, Stefan Scherer, Kevin Carlberg, Neil Walton, Kody J. H. Law

We leverage multilevel Monte Carlo (MLMC) to improve the performance of multi-step look-ahead Bayesian optimization (BO) methods that involve nested expectations and maximizations.

Bayesian Optimization

Exponential Concentration in Stochastic Approximation

no code implementations15 Aug 2022 Kody Law, Neil Walton, Shangda Yang

We apply our results to several different Stochastic Approximation algorithms, specifically Projected Stochastic Gradient Descent, Kiefer-Wolfowitz and Stochastic Frank-Wolfe algorithms.

Learning and Information in Stochastic Networks and Queues

no code implementations18 May 2021 Neil Walton, Kuang Xu

We review the role of information and learning in the stability and optimization of queueing systems.

Decision Making reinforcement-learning +1

Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems

no code implementations21 Apr 2021 Alvaro Cabrejas-Egea, Raymond Zhang, Neil Walton

Recently, Intelligent Transportation Systems are leveraging the power of increased sensory coverage and computing power to deliver data-intensive solutions achieving higher levels of performance than traditional systems.

Q-Learning reinforcement-learning +1

An Adiabatic Theorem for Policy Tracking with TD-learning

no code implementations24 Oct 2020 Neil Walton

We evaluate the ability of temporal difference learning to track the reward function of a policy as it changes over time.

Q-Learning

Fast Approximate Bayesian Contextual Cold Start Learning (FAB-COST)

1 code implementation18 Aug 2020 Jack R. McKenzie, Peter A. Appleby, Thomas House, Neil Walton

Cold-start is a notoriously difficult problem which can occur in recommendation systems, and arises when there is insufficient information to draw inferences for users or items.

Recommendation Systems

Regret Analysis of a Markov Policy Gradient Algorithm for Multi-arm Bandits

no code implementations20 Jul 2020 Denis Denisov, Neil Walton

We consider a policy gradient algorithm applied to a finite-arm bandit problem with Bernoulli rewards.

A Short Note on Soft-max and Policy Gradients in Bandits Problems

no code implementations20 Jul 2020 Neil Walton

This is a short communication on a Lyapunov function argument for softmax in bandit problems.

reinforcement-learning Reinforcement Learning (RL)

Adaptive Pricing in Insurance: Generalized Linear Models and Gaussian Process Regression Approaches

no code implementations2 Jul 2019 Yuqing Zhang, Neil Walton

We develop two pricing models: an adaptive Generalized Linear Model (GLM) and an adaptive Gaussian Process (GP) regression model.

Gaussian Processes Management +1

Cannot find the paper you are looking for? You can Submit a new open access paper.