1 code implementation • 11 Nov 2022 • Xuan Rao, Bo Zhao, Xiaosong Yi, Derong Liu
In neural architecture search (NAS) methods based on latent space optimization (LSO), a deep generative model is trained to embed discrete neural architectures into a continuous latent space.
no code implementations • 11 Oct 2014 • Biao Luo, Derong Liu, Ting-Wen Huang
By introducing the Q-function for continuous-time systems, policy iteration based QL (PIQL) and value iteration based QL (VIQL) algorithms are proposed for learning the optimal control policy from real system data rather than using mathematical system model.
no code implementations • 2 Nov 2013 • Biao Luo, Huai-Ning Wu, Ting-Wen Huang, Derong Liu
Firstly, a model-free policy iteration algorithm is derived for constrained optimal control problem and its convergence is proved, which can learn the solution of HJB equation and optimal control policy without requiring any knowledge of system mathematical model.