Search Results for author: Max Grossman

Found 1 papers, 0 papers with code

Data-parallel distributed training of very large models beyond GPU capacity

no code implementations29 Nov 2018 Samuel Matzek, Max Grossman, Minsik Cho, Anar Yusifov, Bryant Nelson, Amit Juneja

GPUs have limited memory and it is difficult to train wide and/or deep models that cause the training process to go out of memory.

Cannot find the paper you are looking for? You can Submit a new open access paper.