Search Results for author: XuanYu Wang

Found 1 papers, 0 papers with code

Demystifying Workload Imbalances in Large Transformer Model Training over Variable-length Sequences

no code implementations10 Dec 2024 Haoyang Li, Fangcheng Fu, Sheng Lin, Hao Ge, XuanYu Wang, Jiawen Niu, Jie Jiang, Bin Cui

To optimize large Transformer model training, efficient parallel computing and advanced data management are essential.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.