1 code implementation • 25 Jul 2020 • Yosuke Oyama, Naoya Maruyama, Nikoli Dryden, Erin McCarthy, Peter Harrington, Jan Balewski, Satoshi Matsuoka, Peter Nugent, Brian Van Essen
We present scalable hybrid-parallel algorithms for training large-scale 3D convolutional neural networks.
1 code implementation • 13 Apr 2018 • Yosuke Oyama, Tal Ben-Nun, Torsten Hoefler, Satoshi Matsuoka
NVIDIA cuDNN is a low-level library that provides GPU kernels frequently used in deep learning.