no code implementations • 4 Mar 2019 • Sanghamitra Dutta, Ziqian Bai, Tze Meng Low, Pulkit Grover
This work proposes the first strategy to make distributed training of neural networks resilient to computing errors, a problem that has remained unsolved despite being first posed in 1956 by von Neumann.
no code implementations • 27 Nov 2018 • Sanghamitra Dutta, Ziqian Bai, Haewon Jeong, Tze Meng Low, Pulkit Grover
First, we propose a novel coded matrix multiplication technique called Generalized PolyDot codes that advances on existing methods for coded matrix multiplication under storage and communication constraints.