no code implementations • 25 Sep 2019 • Yue Zhao, Xiangsheng Huang, Ludan Kou
Although adaptive algorithms have achieved significant success in training deep neural networks with faster training speed, they tend to have poor generalization performance compared to SGD with Momentum(SGDM).