Stagewise Enlargement of Batch Size for SGD-based Learning

26 Feb 2020Shen-Yi ZhaoYin-Peng XieWu-Jun Li

Existing research shows that the batch size can seriously affect the performance of stochastic gradient descent~(SGD) based learning, including training speed and generalization ability. A larger batch size typically results in less parameter updates... (read more)

PDF Abstract


No code implementations yet. Submit your code now


Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.