no code implementations • 12 Jun 2024 • Massimiliano Lupo Pasini, Jong Youl Choi, Kshitij Mehta, Pei Zhang, David Rogers, Jonghyun Bae, Khaled Z. Ibrahim, Ashwin M. Aji, Karl W. Schulz, Jorda Polo, Prasanna Balaprakash
The HydraGNN architecture enables the GFM to achieve near-linear strong scaling performance using more than 2, 000 GPUs on Perlmutter and 16, 000 GPUs on Frontier.
1 code implementation • 18 Aug 2022 • Jonghyun Bae, Woohyeon Baek, Tae Jun Ham, Jae W. Lee
The decoding process of L3 is effectively parallelized on the accelerator, thus minimizing CPU intervention for data preparation during DNN training.
no code implementations • 1 Jan 2021 • Jonghyun Bae, Ji-Hoon Kim
Data augmentation tuned to datasets and tasks has had great success in various AI applications, such as computer vision, natural language processing, autonomous driving, and bioinformatics.