Search Results for author: Baiyu Su

Found 1 papers, 1 papers with code

Adam through a Second-Order Lens

1 code implementation23 Oct 2023 Ross M. Clarke, Baiyu Su, José Miguel Hernández-Lobato

Research into optimisation for deep learning is characterised by a tension between the computational efficiency of first-order, gradient-based methods (such as SGD and Adam) and the theoretical efficiency of second-order, curvature-based methods (such as quasi-Newton methods and K-FAC).

Computational Efficiency Second-order methods

Cannot find the paper you are looking for? You can Submit a new open access paper.