no code implementations • 15 Nov 2017 • Xiaocheng Li, Huaiyang Zhong, Margaret L. Brandeau
In this paperwe consider the problem of optimizing the quantiles of the cumulative rewards of a Markov decision process(MDP), which we refer to as a quantile Markov decision process (QMDP).
no code implementations • 15 Nov 2017 • Huaiyang Zhong, Xiaocheng Li, David Lobell, Stefano Ermon, Margaret L. Brandeau
Eradicating hunger and malnutrition is a key development goal of the 21st century.