Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization

29 Nov 2019Brendan RuffTaylor BeckJoscha Bach

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster convergence and higher test accuracy... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper