no code implementations • 11 Apr 2020 • Kristopher Reyes, Warren B. Powell
The process of discovery in the physical, biological and medical sciences can be painstakingly slow.
no code implementations • 14 Feb 2020 • Warren B. Powell
We describe our canonical framework that models {\it any} sequential decision problem, and present our definition of state variables that allows us to claim: Any properly modeled sequential decision problem is Markovian.
no code implementations • 19 Dec 2019 • Dionysios S. Kalogerias, Warren B. Powell
We then present a complete analysis of the $\textit{Free-MESSAGE}^{p}$ algorithm, which establishes convergence in a user-tunable neighborhood of the optimal solutions of the original problem for convex costs, as well as explicit convergence rates for convex, weakly convex, and strongly convex costs, and in a unified way.
no code implementations • 7 Dec 2019 • Warren B. Powell
We focus on two of the most important fields: stochastic optimal control, with its roots in deterministic optimal control, and reinforcement learning, with its roots in Markov decision processes.
no code implementations • 18 Oct 2018 • Lina Al-Kanj, Juliana Nascimento, Warren B. Powell
At the same time, electric vehicles are quickly emerging as a next-generation technology that is cost effective, in addition to offering the benefits of reducing the carbon footprint.
no code implementations • 2 Apr 2018 • Dionysios S. Kalogerias, Warren B. Powell
2) Assuming a strongly convex cost, we show that, for fixed semideviation order $p>1$ and for $\epsilon\in\left[0, 1\right)$, the MESSAGEp algorithm achieves a squared-${\cal L}_{2}$ solution suboptimality rate of the order of ${\cal O}(n^{-\left(1-\epsilon\right)/2})$ iterations, where, for $\epsilon>0$, pathwise convergence is simultaneously guaranteed.
no code implementations • 20 Apr 2017 • Daniel R. Jiang, Lina Al-Kanj, Warren B. Powell
Monte Carlo Tree Search (MCTS), most famously used in game-play artificial intelligence (e. g., the game of Go), is a well-known strategy for constructing approximate solutions to sequential decision problems.
no code implementations • 22 Nov 2016 • Xinyu He, Warren B. Powell
We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters.
no code implementations • 18 May 2016 • Lina Al-Kanj, Warren B. Powell, Belgacem Bouzaiene-Ayari
Utilities face the challenge of responding to power outages due to storms and ice damage, but most power grids are not equipped with sensors to pinpoint the precise location of the faults causing the outage.
no code implementations • 7 Sep 2015 • Daniel R. Jiang, Warren B. Powell
In this paper, we consider a finite-horizon Markov decision process (MDP) for which the objective at each stage is to minimize a quantile-based risk measure (QBRM) of the sequence of future costs; we call the overall objective a dynamic quantile-based risk measure (DQBRM).
no code implementations • 6 Aug 2015 • Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell
We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules.
no code implementations • 10 Jul 2014 • Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell
Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems.
no code implementations • 4 Jan 2014 • Warren R. Scott, Warren B. Powell, Somayeh Moazehi
We address several of its enhancements, namely, Bellman error minimization using instrumental variables, least-squares projected Bellman error minimization, and projected Bellman error minimization using instrumental variables.