no code implementations • 17 Jan 2020 • Goran Nakerst, John Brennan, Masudul Haque
In this work, we show that the algorithm can be improved by extending this `acceleration' --- by using the gradient at an estimated position several steps ahead rather than just one step ahead.