no code implementations • 22 Apr 2022 • Rui Ma, Evangelos Georganas, Alexander Heinecke, Andrew Boutros, Eriko Nurvitadhi
The overhead of these collective communication operations in a distributed AI training system can bottleneck its performance, with more pronounced effects as the number of nodes increases.
no code implementations • 14 Dec 2020 • Andrew Boutros, Mathew Hall, Nicolas Papernot, Vaughn Betz
We find that even when using the strongest attacker circuit, the prediction accuracy of the DL accelerator is not compromised when running at its safe operating frequency.