Search Results for author: Yuichi Kageyama

Found 1 papers, 0 papers with code

Massively Distributed SGD: ImageNet/ResNet-50 Training in a Flash

no code implementations • 13 Nov 2018 • Hiroaki Mikami, Hisahiro Suganuma, Pongsakorn U-chupala, Yoshiki Tanaka, Yuichi Kageyama

Scaling the distributed deep learning to a massive GPU cluster level is challenging due to the instability of the large mini-batch training and the overhead of the gradient synchronization.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.