Search Results for author: Justin M. Gilmer

A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes

Recently the LARS and LAMB optimizers have been proposed for training neural networks faster using large batch sizes.

Ranked #1 on Question Answering on SQuAD1.1 (Hardware Burden metric)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.