no code implementations • 6 Sep 2023 • Eyal Neuman, Wolfgang Stockinger, Yufei Zhang
We show that a trader who tries to minimise her execution costs by using a greedy strategy purely based on the estimated propagator will encounter suboptimality due to so-called spurious correlation between the trading strategy and the estimator and due to intrinsic uncertainty resulting from a biased cost functional.
no code implementations • 22 Mar 2022 • Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang
Despite its popularity in the reinforcement learning community, a provably convergent policy gradient method for continuous space-time control problems with nonlinear state dynamics has been elusive.