no code implementations • 18 Oct 2023 • Siddharth Singh, Zachary Sating, Abhinav Bhatele
The primary efficiency bottleneck in such optimizers is matrix inverse calculations in the preconditioning step, which are expensive to compute on GPUs.
Computational Efficiency Second-order methods