Search Results for author: Max Grossman

Found 1 papers, 0 papers with code

Data-parallel distributed training of very large models beyond GPU capacity

no code implementations • 29 Nov 2018 • Samuel Matzek, Max Grossman, Minsik Cho, Anar Yusifov, Bryant Nelson, Amit Juneja

GPUs have limited memory and it is difficult to train wide and/or deep models that cause the training process to go out of memory.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.