1 code implementation • 8 Aug 2022 • Jiarui Fang, Geng Zhang, Jiatong Han, Shenggui Li, Zhengda Bian, Yongbin Li, Jin Liu, Yang You
Deep learning recommendation models (DLRMs) have been widely applied in Internet companies.
1 code implementation • 28 Oct 2021 • Shenggui Li, Hongxin Liu, Zhengda Bian, Jiarui Fang, Haichen Huang, Yuliang Liu, Boxiang Wang, Yang You
The success of Transformer models has pushed the deep learning model scale to billions of parameters.
no code implementations • 8 Aug 2021 • Zhengda Bian, Shenggui Li, Wei Wang, Yang You
ONES automatically manages the elasticity of each job based on the training batch size, so as to maximize GPU utilization and improve scheduling efficiency.
no code implementations • 30 May 2021 • Boxiang Wang, Qifan Xu, Zhengda Bian, Yang You
It increases efficiency by reducing communication overhead and lowers the memory required for each GPU.
no code implementations • 30 May 2021 • Zhengda Bian, Qifan Xu, Boxiang Wang, Yang You
Our work is the first to introduce a 3-dimensional model parallelism for expediting huge language models.