no code implementations • 8 Jun 2020 • W. Tao, Z. Pan, G. Wu, Q. Tao
The extrapolation strategy raised by Nesterov, which can accelerate the convergence rate of gradient descent methods by orders of magnitude when dealing with smooth convex objective, has led to tremendous success in training machine learning tasks.