no code implementations • 13 Mar 2020 • Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Younmin Bae, Da Eun Shim, Hyojong Kim, Sung-Kyu Lim, Michael S. Ryoo, Hyesoon Kim
To benefit from available compute resources with low communication overhead, we propose the first DNN parallelization method for reducing the communication overhead in a distributed system.
no code implementations • 16 Mar 2018 • Bahar Asgari, Saibal Mukhopadhyay, Sudhakar Yalamanchili
However, these efforts ignored maintaining a balance between bandwidth and compute rate of an architecture, with those of applications, which is a key principle in designing scalable large systems.
Hardware Architecture Performance