Search Results for author: Jingkun Ma

Found 1 papers, 0 papers with code

Breaking MLPerf Training: A Case Study on Optimizing BERT

no code implementations4 Feb 2024 YongDeok Kim, Jaehyung Ahn, Myeongwoo Kim, Changin Choi, Heejae Kim, Narankhuu Tuvshinjargal, Seungwon Lee, Yanzi Zhang, Yuan Pei, Xiongzhan Linghu, Jingkun Ma, Lin Chen, Yuehua Dai, Sungjoo Yoo

Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc.

Hyperparameter Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.