Search Results for author: Ruobing Han

Found 2 papers, 2 papers with code

Auto-Precision Scaling for Distributed Deep Learning

1 code implementation20 Nov 2019 Ruobing Han, James Demmel, Yang You

Our experimental results show that for many applications, APS can train state-of-the-art models by 8-bit gradients with no or only a tiny accuracy loss (<0. 05%).

Image Classification

Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes

1 code implementation19 Feb 2019 Peng Sun, Wansen Feng, Ruobing Han, Shengen Yan, Yonggang Wen

To address this problem, we propose a communication backend named GradientFlow for distributed DNN training, and employ a set of network optimization techniques.

Distributed, Parallel, and Cluster Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.