Search Results for author: Warren B. Powell

Found 13 papers, 0 papers with code

Optimal Learning for Sequential Decisions in Laboratory Experimentation

no code implementations11 Apr 2020 Kristopher Reyes, Warren B. Powell

The process of discovery in the physical, biological and medical sciences can be painstakingly slow.

On State Variables, Bandit Problems and POMDPs

no code implementations14 Feb 2020 Warren B. Powell

We describe our canonical framework that models {\it any} sequential decision problem, and present our definition of state variables that allows us to claim: Any properly modeled sequential decision problem is Markovian.

Active Learning

Zeroth-order Stochastic Compositional Algorithms for Risk-Aware Learning

no code implementations19 Dec 2019 Dionysios S. Kalogerias, Warren B. Powell

We then present a complete analysis of the $\textit{Free-MESSAGE}^{p}$ algorithm, which establishes convergence in a user-tunable neighborhood of the optimal solutions of the original problem for convex costs, as well as explicit convergence rates for convex, weakly convex, and strongly convex costs, and in a unified way.

Stochastic Optimization

From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions

no code implementations7 Dec 2019 Warren B. Powell

We focus on two of the most important fields: stochastic optimal control, with its roots in deterministic optimal control, and reinforcement learning, with its roots in Markov decision processes.

reinforcement-learning Reinforcement Learning (RL) +1

Approximate Dynamic Programming for Planning a Ride-Sharing System using Autonomous Fleets of Electric Vehicles

no code implementations18 Oct 2018 Lina Al-Kanj, Juliana Nascimento, Warren B. Powell

At the same time, electric vehicles are quickly emerging as a next-generation technology that is cost effective, in addition to offering the benefits of reducing the carbon footprint.

Autonomous Vehicles

Recursive Optimization of Convex Risk Measures: Mean-Semideviation Models

no code implementations2 Apr 2018 Dionysios S. Kalogerias, Warren B. Powell

2) Assuming a strongly convex cost, we show that, for fixed semideviation order $p>1$ and for $\epsilon\in\left[0, 1\right)$, the MESSAGEp algorithm achieves a squared-${\cal L}_{2}$ solution suboptimality rate of the order of ${\cal O}(n^{-\left(1-\epsilon\right)/2})$ iterations, where, for $\epsilon>0$, pathwise convergence is simultaneously guaranteed.

Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds

no code implementations20 Apr 2017 Daniel R. Jiang, Lina Al-Kanj, Warren B. Powell

Monte Carlo Tree Search (MCTS), most famously used in game-play artificial intelligence (e. g., the game of Go), is a well-known strategy for constructing approximate solutions to sequential decision problems.

Game of Go

Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models

no code implementations22 Nov 2016 Xinyu He, Warren B. Powell

We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters.

Stochastic Optimization

The Information-Collecting Vehicle Routing Problem: Stochastic Optimization for Emergency Storm Response

no code implementations18 May 2016 Lina Al-Kanj, Warren B. Powell, Belgacem Bouzaiene-Ayari

Utilities face the challenge of responding to power outages due to storms and ice damage, but most power grids are not equipped with sensors to pinpoint the precise location of the faults causing the outage.

Stochastic Optimization

Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures

no code implementations7 Sep 2015 Daniel R. Jiang, Warren B. Powell

In this paper, we consider a finite-horizon Markov decision process (MDP) for which the objective at each stage is to minimize a quantile-based risk measure (QBRM) of the sequence of future costs; we call the overall objective a dynamic quantile-based risk measure (DQBRM).

A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

no code implementations6 Aug 2015 Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell

We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules.

A New Optimal Stepsize For Approximate Dynamic Programming

no code implementations10 Jul 2014 Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems.

Management

Least Squares Policy Iteration with Instrumental Variables vs. Direct Policy Search: Comparison Against Optimal Benchmarks Using Energy Storage

no code implementations4 Jan 2014 Warren R. Scott, Warren B. Powell, Somayeh Moazehi

We address several of its enhancements, namely, Bellman error minimization using instrumental variables, least-squares projected Bellman error minimization, and projected Bellman error minimization using instrumental variables.

Cannot find the paper you are looking for? You can Submit a new open access paper.