Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

NeurIPS 2018 Timur GaripovPavel IzmailovDmitrii PodoprikhinDmitry VetrovAndrew Gordon Wilson

The loss functions of deep neural networks are complex and their geometric properties are not well understood. We show that the optima of these complex loss functions are in fact connected by simple curves over which training and test accuracy are nearly constant... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.