Search Results for author: Warren B. Powell

Found 13 papers, 0 papers with code

Optimal Learning for Sequential Decisions in Laboratory Experimentation

no code implementations • 11 Apr 2020 • Kristopher Reyes, Warren B. Powell

The process of discovery in the physical, biological and medical sciences can be painstakingly slow.

Paper
Add Code

On State Variables, Bandit Problems and POMDPs

no code implementations • 14 Feb 2020 • Warren B. Powell

We describe our canonical framework that models {\it any} sequential decision problem, and present our definition of state variables that allows us to claim: Any properly modeled sequential decision problem is Markovian.

Active Learning

Paper
Add Code

Zeroth-order Stochastic Compositional Algorithms for Risk-Aware Learning

no code implementations • 19 Dec 2019 • Dionysios S. Kalogerias, Warren B. Powell

We then present a complete analysis of the $\textit{Free-MESSAGE}^{p}$ algorithm, which establishes convergence in a user-tunable neighborhood of the optimal solutions of the original problem for convex costs, as well as explicit convergence rates for convex, weakly convex, and strongly convex costs, and in a unified way.

Stochastic Optimization

Paper
Add Code

From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions

no code implementations • 7 Dec 2019 • Warren B. Powell

We focus on two of the most important fields: stochastic optimal control, with its roots in deterministic optimal control, and reinforcement learning, with its roots in Markov decision processes.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Approximate Dynamic Programming for Planning a Ride-Sharing System using Autonomous Fleets of Electric Vehicles

no code implementations • 18 Oct 2018 • Lina Al-Kanj, Juliana Nascimento, Warren B. Powell

At the same time, electric vehicles are quickly emerging as a next-generation technology that is cost effective, in addition to offering the benefits of reducing the carbon footprint.

Autonomous Vehicles

Paper
Add Code

Recursive Optimization of Convex Risk Measures: Mean-Semideviation Models

no code implementations • 2 Apr 2018 • Dionysios S. Kalogerias, Warren B. Powell

2) Assuming a strongly convex cost, we show that, for fixed semideviation order $p>1$ and for $\epsilon\in\left[0, 1\right)$, the MESSAGEp algorithm achieves a squared-${\cal L}_{2}$ solution suboptimality rate of the order of ${\cal O}(n^{-\left(1-\epsilon\right)/2})$ iterations, where, for $\epsilon>0$, pathwise convergence is simultaneously guaranteed.

Paper
Add Code

Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds

no code implementations • 20 Apr 2017 • Daniel R. Jiang, Lina Al-Kanj, Warren B. Powell

Monte Carlo Tree Search (MCTS), most famously used in game-play artificial intelligence (e. g., the game of Go), is a well-known strategy for constructing approximate solutions to sequential decision problems.

Game of Go

Paper
Add Code

Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models

no code implementations • 22 Nov 2016 • Xinyu He, Warren B. Powell

We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters.

Stochastic Optimization

Paper
Add Code

The Information-Collecting Vehicle Routing Problem: Stochastic Optimization for Emergency Storm Response

no code implementations • 18 May 2016 • Lina Al-Kanj, Warren B. Powell, Belgacem Bouzaiene-Ayari

Utilities face the challenge of responding to power outages due to storms and ice damage, but most power grids are not equipped with sensors to pinpoint the precise location of the faults causing the outage.

Stochastic Optimization

Paper
Add Code

Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures

no code implementations • 7 Sep 2015 • Daniel R. Jiang, Warren B. Powell

In this paper, we consider a finite-horizon Markov decision process (MDP) for which the objective at each stage is to minimize a quantile-based risk measure (QBRM) of the sequence of future costs; we call the overall objective a dynamic quantile-based risk measure (DQBRM).

Paper
Add Code

A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

no code implementations • 6 Aug 2015 • Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell

We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules.

Paper
Add Code

A New Optimal Stepsize For Approximate Dynamic Programming

no code implementations • 10 Jul 2014 • Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems.

Management

Paper
Add Code

Least Squares Policy Iteration with Instrumental Variables vs. Direct Policy Search: Comparison Against Optimal Benchmarks Using Energy Storage

no code implementations • 4 Jan 2014 • Warren R. Scott, Warren B. Powell, Somayeh Moazehi

We address several of its enhancements, namely, Bellman error minimization using instrumental variables, least-squares projected Bellman error minimization, and projected Bellman error minimization using instrumental variables.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.