no code implementations • 9 Apr 2021 • Zhou Zhai, Bin Gu, Heng Huang
To explore this problem, in this paper, we propose a new reinforcement learning based ZO algorithm (ZO-RL) with learning the sampling policy for generating the perturbations in ZO optimization instead of using random sampling.
no code implementations • 24 Dec 2019 • Zhou Zhai, Bin Gu, Xiang Li, Heng Huang
To address this challenge, in this paper, we propose two safe sample screening rules for RSVM based on the framework of concave-convex procedure (CCCP).