Search Results for author: Guo-qing Jiang

Found 1 papers, 0 papers with code

Accelerating Large Batch Training via Gradient Signal to Noise Ratio (GSNR)

no code implementations24 Sep 2023 Guo-qing Jiang, Jinlong Liu, Zixiang Ding, Lin Guo, Wei Lin

As models for nature language processing (NLP), computer vision (CV) and recommendation systems (RS) require surging computation, a large number of GPUs/TPUs are paralleled as a large batch (LB) to improve training throughput.

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.