no code implementations • 25 Feb 2023 • Zhifa Ke, Junyu Zhang, Zaiwen Wen
Under mild conditions, non-asymptotic finite-sample convergence to the globally optimal Q function is derived for various nonlinear function approximations.
Offline RL Q-Learning