no code implementations • 16 Nov 2022 • Siyuan Lu, Chenchen Zhou, Keli Xie, Jun Lin, Zhongfeng Wang
Based on ELBERT, an innovative method to accelerate text processing on the GPU platform is developed, solving the difficult problem of making the early exit mechanism work more effectively with a large input batch size.