no code implementations • NeurIPS 2019 • Bin Hu, Usman Ahmed Syed
For both the IID and Markov noise cases, we show that the evolution of some augmented versions of the mean and covariance matrix of the TD estimation error exactly follows the trajectory of a deterministic linear time-invariant (LTI) dynamical system.