1 code implementation • 10 May 2019 • Anand Jayarajan, Jinliang Wei, Garth Gibson, Alexandra Fedorova, Gennady Pekhimenko
Data parallel training is widely used for scaling distributed deep neural network (DNN) training.