Search Results for author: Sean Treichler

Found 4 papers, 1 papers with code

Exascale Deep Learning for Scientific Inverse Problems

no code implementations24 Sep 2019 Nouamane Laanait, Joshua Romero, Junqi Yin, M. Todd Young, Sean Treichler, Vitalii Starchenko, Albina Borisevich, Alex Sergeev, Michael Matheson

We introduce novel communication strategies in synchronous distributed Deep Learning consisting of decentralized gradient reduction orchestration and computational graph-aware grouping of gradient tensors.

Materials Imaging

Task Bench: A Parameterized Benchmark for Evaluating Parallel Runtime Performance

no code implementations15 Aug 2019 Elliott Slaughter, Wei Wu, Yuankun Fu, Legend Brandenburg, Nicolai Garcia, Wilhem Kautz, Emily Marx, Kaleb S. Morris, Wonchan Lee, Qinglei Cao, George Bosilca, Seema Mirchandaney, Sean Treichler, Patrick McCormick, Alex Aiken

We present Task Bench, a parameterized benchmark designed to explore the performance of parallel and distributed programming systems under a variety of application scenarios.

Distributed, Parallel, and Cluster Computing

Exascale Deep Learning for Climate Analytics

3 code implementations3 Oct 2018 Thorsten Kurth, Sean Treichler, Joshua Romero, Mayur Mudigonda, Nathan Luehr, Everett Phillips, Ankur Mahesh, Michael Matheson, Jack Deslippe, Massimiliano Fatica, Prabhat, Michael Houston

The Tiramisu network scales to 5300 P100 GPUs with a sustained throughput of 21. 0 PF/s and parallel efficiency of 79. 0%.

Distributed, Parallel, and Cluster Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.